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Abstract 

I give an introduction to the QCD-based theory of the heavy flavor hadrons and their 
weak decays. Trying to remain at the next-to-elementary level and skip technicali- 
ties, I concentrate on the qualitative description of the most important applications 
and physical meaning of the theoretical statements. The numerical results of the 
dedicated theoretical analyses of extracting \V c b\ are given and the possibilities to 
determine \V u b\ in future are discussed. At the same time I describe in simple lan- 
guage subtle peculiarities distinguishing actual QCD of heavy quarks from naive 
quantum mechanical treatment often applied to heavy flavor hadrons. These sub- 
tleties are often mistreated. Particular attention is paid to the concept of the heavy 
quark mass and its evaluation, to the kinetic operator and the question of the 1/mg 
corrections to inclusive widths of heavy flavor hadrons. I argue that the properly 
defined b quark mass is known with a good accuracy from the bb threshold cross 
section. 
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1 Introduction 



The Standard Model (SM) has six quarks whose existence by now is reliably estab- 
lished in experiment. They are naturally allocated to three generations 






Masses of u and d quarks are only a few MeV, much smaller than Aqcd, and in most 
applications of hadron physics u and d can be considered as massless. The strange 
quark mass is about 150 MeV, that is, literally only a little smaller than Aqcd- 
Nevertheless, there is ample evidence that treating s-quark as light is justified, and 
corrections to the (light) 577(3) symmetry are reasonably small. Thus, m s lies 
essentially below the actual typical hadronic QCD scale /ihadr ~ 500 to 700 MeV. 

Those quarks Q for which tuq ^> /ihadr ~ (2 -3) Aqcd are heavy quarks. The 
sixth t quark is the heaviest, m t ~ 170 GeV. However, it is too heavy. Its width 
due to the 'semiweak' decay t — > b + W + is Y t ~ 1 GeV. It decays too fast for the 
t-hadrons, the bound states with light quarks to be formed. 

The best candidate for application of the Heavy Quark Expansion (HQE) [] are 
beauty hadrons. It is heavy enough to confidently use the expansion in l/m&, yet 
it appears not too heavy for the leading corrections to the heavy quark limit to be 
negligible. 

The charmed quark c can be called heavy only with some reservations. While in 
some cases it gives a reasonable approximation, m c often appears manifestly too low 
for a quantitative treatment of charmed physics in the 1/rriQ expansion. There is no 
universal answer here, and considerable care must be exercised in such a treatment 
in every particular case. 

The hadrons we consider are composed from one heavy quark Q, and a light 
cloud: a light antiquark q, or diquark qq, together with the gluon medium and 
light quark- antiquark pairs. The role of the gluon medium is to keep all 'valence' 
constituents together, in a colorless bound state which will be generically denoted 
by Hq. Therefore, we have the following simplified picture of a heavy-flavor hadron. 
The heavy quark has a small size ~ I/tuq, and is surrounded by a static Coulomb- 
like color field Aq at small distances. Non-Abelian self-interaction slightly modifies 
the potential, but the non-linearity is driven by the coupling a s (V -1 ) /tt and is not 
significant. At larger distances the self- interaction strengthens, at R > Aq CD it is 
completely nonperturbative: the soft modes of the light fields are strongly coupled 
and strongly fluctuate. 

What happens when such a hadronic state is disturbed? In weak decays the 
standard situation is that the external (to QCD) forces like H^-bosons interact with 

lr rhere exists a commonly accepted abbreviation HQET which sounds very similar. It denotes 
the "Heavy Quark Effective Theory" , a rather specific approach extensively used for a number of 
exclusive transitions. In the past every theoretical consideration related to heavy quarks was often 
not quite consistently called HQET. The difference will be illustrated later. The term HQE was 



introduced, apparently, in [ 127 1 to distinguish a more general direct QCD-based approach. 
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the heavy quark, say, instantaneously replace the b quark by c quark generally 
changing its velocity, make it to move. Such an event excites first the typical modes 
of the heavy quark, hard gluons with k typ ~ mq. Since 

« s Otyp) ~ cn s (m Q ) < 1 

one can apply perturbation theory to describe what happens there. 

An actual decay process, nevertheless, eventually runs into the strong-interaction 
nonperturbative domain of k ~ uj ~ Aqcd, where u denotes the characteristic fre- 
quencies. The final hadronization dynamics shaping the hadrons we observe in 
experiment, is a result of soft nonperturbative physics which is responsible for con- 
finement. It is important that the complicated final state dynamics involve u -C tuq. 
This means that to deal with nonperturbative effects one can safely use a nonrela- 
tivistic expansion for the heavy quark. 

The main subject of the HQE is nonperturbative physics. The reason is twofold: 
first, the perturbative corrections are straightforward. The actual computations are 
often cumbersome and, beyond the first-order effects, typically require sophisticated 
state-of-the-art technique. Nevertheless, they are conceptually simple. 

The second reason is that, anyway the perturbative corrections are calculated in 
the full QCD rather than in the effective low-energy theory, since the part (often 
dominant) comes just from the gluon momenta k ~ mq where the nonrelativistic 
approximation is not applicable. Still, it is worth noting that the interplay of the 
perturbative and nonperturbative effects is quite nontrivial and involves theoretical 
subtleties which are not always fully respected. 

The treatment of the nonperturbative effects is a nontrivial problem, and differ- 
ent methods of QCD are used here. The basic tool for all of them in heavy quarks 
is the Wilson operator product expansion (OPE) Stating briefly, 

HQE = OPE + nonrelativistic expansion . 

Unless an analytic solution of QCD is found, these two ingredients are indispensable 
for the heavy quark theory. What do they mean and help us with? 

1.1 Nonrelativistic Expansion 

The main simplification of a nonrelativistic treatment is that the number of heavy 
quarks uq and antiquarks uq are separately conserved. Propagation of the nonrela- 
tivistic quark with usual relativistic Green function for the heavy quark contains also 
the process of the QQ pair creation if the time ordering of the vertices along the line 
is reversed somewhere. In the nonrelativistic kinematics, however, such configura- 
tions yield a power-suppressed contribution, since the virtuality of the intermediate 
state (the energy denominator, in the language of noncovariant perturbation theory 
of quantum mechanics) is of the order of 2mg. In order to observe such processes 
as real ones, it would be necessary to supply energy at least as large as to ~ 2itiq. 
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Heavy quarks can appear also in closed loops, for example, in the processes of 
the virtual gluon conversion. For heavy quarks such effects are also suppressed, 
~ k 2 /nig if the gluon momentum k is much smaller than mq. 

As a result, the field-theoretic, or second-quantized description of the heavy 
quark becomes redundant, and it is sufficient to resort to its usual quantum- mechanical 
(QM) treatment. For example, the wavefunction of a heavy flavor hadron takes the 
form 

*a [% {flight}] , (1) 

where xq is the heavy quark coordinate, xught generically represents an infinite 
number of light degrees of freedom; index a describes the heavy quark spin. 

A relativistic S — | particle has four components, i.e. a = 1...4. The non- 
relativistic spinor ^(x) has only two of them, a = 1,2 describing two spin states. 
The remaining a = {3, 4} components describe antiparticles which decouple in the 
nonrelativistic theory: 

«,) = { 'ft ) * ( t ~ T n (2) 

The nonrelativistic Hamiltonian of the spin-| particle is the well known Pauli Hamil- 
tonian: 

H Pauli = -g s A + ^ — '- + ^— . 3 

2m 2m 

The last operator was written with a coefficient appropriate for an 'elementary' 
pointlike particle. Presence of the interaction generally renormalizes its strength, 
the chromomagnetic moment of the heavy quark. In the heavy quark limit mg — > oo 
only the first term survives while the last two terms describing space propagation 
and interaction of spin with the chromomagnetic field B, disappear. An infinitely 
heavy quark is static and interacts only with the color Coulomb potential.^] In turn, 
the heavy quark is a source of a static color field independent of the heavy quark 
spin. The actual dynamics of the heavy quark spin reveal itself in full only when 
the quark interacts with quanta having large momentum \q\ > tuq. 

It is easy to illustrate the b quark spin-independence of the strong forces in the 
following way. One can imagine the QCD world where, instead of the actual b quark 
there exists a scalar spinless b quark with the same mass. The usual spin-0 B meson, 
bq would become a spin-| particle B ~ bq. The A b baryon bud having spin-| would, 
in turn become a scalar spinless bud state. Nevertheless, at mj, — > oo the properties 
of B and B or A& and A& would become identical. The two degenerate spin states 
of B are counterparts of B and B* of the actual QCD. 

2 Speaking of the Coulomb interaction in QCD we mean the interaction with the time- like 
component Aq of the color gauge potential. It differs from the simple 1/r electrodynamic potential. 
It is also customary to absorb the coupling g s into the definition of the gauge potential, and I often 
omit it from the expressions. 
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An immediate question comes to one's mind at this point: what about the 
relation between spin and statistics? In such a gedanken operation we replace spin- 
| hadrons by S = ones, and vice versa, i.e. interchange fermions and bosons. 
It is clear, however, that for the states or processes with a single heavy quark the 
statistics symmetry properties do not play a role. 

Such an independence of the strong dynamics of the heavy quark spin is called 
the heavy quark Spin Symmetry. 

The rest-mass tuq of the heavy quark also enters in a trivial way: the Hamiltonian 
merely contains (jiq + nqjinq. Since both are fixed, it is just an overall additive 
constant. For a moving quark this constant is E = J rrtq + p 2 = rriQy/1 + v 2 . 
Therefore, the actual dynamics is not affected by the concrete value of the mass tuq. 
This is a Heavy Flavor Symmetry which states, for example, the equal properties 
of charmed and beauty hadrons to the extent they both can be considered heavy 
enough. 

The heavy flavor symmetry leads also to certain scaling behavior of the transition 
amplitudes with the heavy flavor hadrons: the amplitudes depend on their velocities 
rather than on the absolute values of momenta: 

(pin pout \ 

-2-,pr* • (4) 
m Q m QI J 

Here P denote the momenta of the heavy flavor hadrons and p refer to other par- 
ticipating particles. It is important to keep in mind that such a scaling behavior is 
valid only with respect to the soft part of the interaction, and no hard gluons with 
k ~ rriQ are involved. 

The one-particle (or QM) description of the heavy quark degrees of freedom is 
the key simplification of the nonrelativistic expansion. The light cloud, however, 
still requires a full-fledged field-theoretic treatment. Even considering the static 
limit uiq — > oo where only interaction with the Coulomb field Aq remains, one 
should realize that it is 'very quantum' and strongly fluctuate, in contrast to simple 
potential QM models where the corresponding potential V(x) is merely a c- number 
function of coordinates. 

For this reason even the quantum mechanics of heavy quarks is highly nontriv- 
ial. Exploiting the symmetry properties of the heavy quark interactions does not 
require understanding of these complicated strong-interaction dynamics. This was 
the main field of applications at an early stage of theoretical development of heavy 
quark physics. The recent progress is mainly related to a better treatment of ba- 
sic properties of this complicated strongly-interacting system f] via application of 
dynamic QCD methods based on the short-distance expansion. 

3 In early papers on heavy quarks the system of soft components of quark and gluon fields 
constituting the light cloud was often negligently referred to as "brown muck" , which seems unfair. 
Already in the pre-heavy-quark era most of the impressive theoretical results leading, for example, 
to formulation of QCD as a theory of strong interactions, were related to ingenious treatment of 
this soft cloud in the world of light hadrons. 
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1.2 Operator Product Expansion 

The basic theoretical tool of the heavy quark theory is the Wilson operator product 
expansion [|IJ. It is not so easy to give a precise definition of what is OPE in a 
few understandable words. In particular, I found that many authors often associate 
with it somewhat different ideas, or even limit it to concrete technicalities. 

The idea of the OPE was formulated by K. Wilson in the late 60's, originally in 
the context of the statistical problems which are closely related to the field theories 
in Euclidean space. This idea, in general, is a separation of effects originating at 
large and small distances. The application to real physical processes in Minkowski 
space is often less transparent and technically more complicated, however it is always 
based on the same underlying concept. 

The original QCD Lagrangian 

£ = ~G%G% + Y,Q(iP- m q )q + J2Q(iP- m Q )Q = C hght + £ Q(i - m Q )Q 

1 Q Q 

(5) 

is formulated at very short distances, or, which is the same, at a very high normal- 
ization point jj, = M Q , where M is the mass of an ultraviolet regulator. In other 
words, the normalization point is assumed to be much higher than all mass scales in 
the theory, in particular, fi 3> toq. An effective theory for describing the low-energy 
properties is obtained by evolving the Lagrangian from the high scale Mq down to 
a lower normalization point fi. This means that we integrate out, step by step, 
all high-frequency modes in the theory thus calculating the Lagrangian C(fi). The 
latter is a full-fledged Lagrangian with respect to the soft modes with characteristic 
frequencies less than /i. The hard (high-frequency) modes determine the coefficient 
functions in £(fi), while the contribution of the soft modes is hidden in the matrix 
elements of (an infinite set of) operators appearing in £(/z). The value of this ap- 
proach, outlined by Wilson long ago [[]] , has become widely recognized and exploited 
in countless applications. The peculiarity of the heavy quark theory lies in the fact 
that the in and out states contain heavy quarks. Although we integrate out the field 
fluctuations with the frequencies down to \i <C tuq, the heavy quark fields them- 
selves are not integrated out since we consider the sector with nonzero heavy-flavor 
charge. The effective Lagrangian acts in this sector. Since the heavy quarks 
are neither produced nor annihilated, any sector with the given Q-number is treated 
separately from all others. 

The simplest and most familiar example of integrating out high-frequency modes 
is the low-energy four-fermion weak decay Lagrangian. The weak decays are me- 
diated by virtual exchanges of W bosons and, in principle, depend on the external 
momenta via the momentum flowing through the W boson line: 

A = — V r c6 V r ^M c (p c )7^(l-7 5 )M b (p & ) u d (p d )"/ v (l-"/ 5 )u u (p u ) — ^ 2 w n/f2 ■ (6) 

o (pb — Pc) — M W 
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The nontrivial denominator signifies propagation of a particle, the W boson. Elim- 
inating explicit W boson from the theory we are left with amplitudes generated by 
the local effective Lagrangian: 

C w = -^=V cb V* d J rf 4 xc(x)7 M (l-7 5 )6(x)rf(x)7 At (l-7 5 )M(x) + O (^rj > ( 7 ) 

provided all momenta are much smaller than M\y- 

This example is particularly simple in a few aspects: the W boson does not 
couple to gluons, and it has a large energy Mw even if it carries a small momentum. 
The OPE works perfectly in a more general situation as well: for example, one can 
integrate out hard gluons with large momenta (virtualities) even though the gluon 
field is massless, and gluons couple to each other. In spite of these complications, the 
effect of hard gluons is given by series of local operators generating simple effective 
vertices, which are more and more suppressed when their dimension increases. To 
state it differently, soft particles (fluctuations) can "look inside" a hard process only 
for the price of power suppression. 

This basic idea applies to any field theory. It brings in additional advantages for 
QCD related to the fact that QCD is a renormalizable theory. Another convenience 
is that the emerging effective operators are gauge- invariant. This reduces their 
number and leads to many useful relations among generated amplitudes. Combining 
the OPE with nonrelativistic expansion is particularly constraining. For example, 
if the heavy quark spin is switched off, there is no independent axial current, etc. 

Another feature is that in QCD the short- distance physics is governed by the 
small running coupling a s (q 2 ) and, therefore, the coefficients in front of these oper- 
ators, affected by the QCD dynamics, can be calculated in perturbation theory. 

Heavy quark physics has an intrinsic large scale tuq. Nevertheless, the majority 
of phenomenologically interesting processes are 'soft', that is, essentially dependent 
on the low-scale dynamics as well. There are a few exceptions which are the subject 
of special attention. An important progress in the heavy quark theory was reached 
identifying those processes that are genuinely 'hard' in this respect. 

2 Basics of the Heavy Quark Theory 
2.1 Effective Hamiltonian 

What are virtual particles which must be integrated out to obtain an effective theory 
suitable for heavy quarks? In the case of ordinary weak decays the answer was quite 
obvious, at least in the tree approximation: one had heavy virtual W which could 
virtually appear only for a short interval of time ~ 1/Mw allowed by the uncertainty 
principle. For a heavy quark Q its mass iriq is a large parameter, but the quark 
itself should not be eliminated completely since it is present in the initial state. 

The strategy is described in the text-books. The nonrelativistic fermion field 
Q(x) has four degrees of freedom. Two of them, $?(x) in Eq. (|2|) are nearly on-shell 
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and two xi x ) are highly virtual describing excitation of antiquarks. One needs to 
integrate out first the antiparticle fields x(x). For simplicity, we consider it in the 
rest frame. 

Integrating out the antiparticle fields can be done straightforwardly since the 
QCD Lagrangian is bilinear in the quark fields. In this way one would obtain the 
(tree-level version of the) so-called Lagrangian of HQET £hqet- It is a correct 
nonrelativistic Lagrangian up to the first order in l/mq. 

This does not complete the program, however: one yet has a full 'particle' field 
which includes all frequencies from to oo. Without taking care of this problem 
we would be in a position of a theorist in the end of the 20 's when the nonrelativistic 
QM of a charged particle interacting with the electromagnetic field had been written, 
but its usefulness was severely limited by the presence of ultraviolet (UV) divergences 
when the corrections due to quantum fluctuations were considered. Such a problem 
does not show up in the potential models where modes with k ^> Aqcd are not 
excited. However, in actual QCD all radiative corrections would diverge due to 
large momentum gluons. 

Therefore, besides the antiparticle fields one needs to integrate out hard gluons 
and the high frequency components of the heavy quark field *f>(x) itself, those for 
which | A; |, a; > fi. The scale fi is the normalization point of the effective theory. Since 
the configurations we integrate out depend on /i, the remaining effective low-energy 
theory cannot but have to be /^-dependent. 

In practice, we want to have \x <C and, actually, as low as possible. On the 
other hand, fj, must still belong to the perturbative domain. In practice this means 
that the best choice routinely adopted for applications is fi ~ a few x Aqcd, from 
0.7 to 1 GeV. All coefficients in the effective Lagrangian obtained in this way are 
functions of //. 

The nonrelativistic heavy quark expansion can be performed in somewhat dif- 
ferent ways. The best known examples are NRQCD (Nonrelativistic QCD) and 
HQET H || which are mainly used for describing exclusive heavy flavor transitions 
or static characteristics. At first glance, HQET differs from NRQCD only in the 
choice of the zero-order approximation for vtlq — > oo, by discarding even the stan- 
dard QM kinetic term p 2 /2rriQ together with the Pauli term aH/2rriQ. However, in 
higher orders in 1/mg the expansion has not been applied quite consistently until 
recently. The correct way was advocated by Korner et al. || and followed the 
text-book methods elaborated in early days of QM. It thus coincided with the ap- 
proach of NRQCD (the practical applications of the latter usually did not address 
higher orders in 1/mg where the difference emerges). Unfortunately for HQET, for 
a few years these justified suggestions were ignored and, with a delay, admitted only 
recently [|, [|. 

Another peculiarity of HQET as an effective theory is that, at the technical 
level, it attempts to "integrate out" perturbative corrections down to \i = (except 
for some special cases). While such a procedure is not justified and, rather, looks 
illegitimate, it is often possible to perform such an integration in the concrete orders 
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in perturbation theory. These inconsistencies seemed to be curable technicalities. 
However, in a number of publications 0, |9], [U], |6|, [UJ they were canonized and 
concrete computational techniques having limited applicability were promoted to 
the status of indispensable elements of HQET itself. As a result, there is no yet 
a clear understanding of what exactly is HQET as an actual quantum field theory 
(QFT), and this ambiguity is reflected in the existing definitions of some fundamental 
HQET parameters. Inaccurate treatment of such theoretical subtleties sometimes 
led to paradoxical claims. For example, it was stated || that HQET contains a 
source of the nonperturbative 1/rriQ corrections to inclusive decay widths, contrary 
to the theorem established in QCD itself. 

To summarize, in treating heavy quarks we separate all strong interaction effects 
into 'hard' and 'soft' introducing a normalization scale fi. To calculate the effect 
of the short- distance (perturbative) physics we use the original QCD Lagrangian 
Eq. (^). The soft physics is treated by the nonrelativistic Lagrangian where the 
heavy quarks are represented by the corresponding nonrelativistic fields ^>q\ 



-eft 



_p2 

' 4 M" 



^q(iip- m q )q + ^ {-m Q ip+ip Q + <p%iD <p Q 
Q Q 



tot, (aiD 



5m; 



-(DE) + a ■ {E x 7T- 7T x E} 



where 



7T 



iD 



p-A, (aiD) 2 = (an) 2 = n 2 + oB 



(8) 



(9) 



for simplicity I wrote only the tree-level //-independent coefficients. By the standard 
rules one constructs from the Lagrangian (|8|) the corresponding Hamiltonian. The 
heavy quark part takes the form 



He 



1 ^1 
-A + - (n 2 + aB) + — 



2m c 



-(DE) + a-{Exn-nxE} +0(l/m, 



(10) 

Since the external interactions (electromagnetic, weak etc.) are given in terms 
of the full QCD fields Q(x), one needs also the relation between Q(x) and ^q(x): 



{an 



1 + 7o 



Q 



7q 



Q 



i 



Q 



2m Q 



Q 



Although obtaining the nonrelativistic Lagrangian is a standard text-book pro- 
cedure, let me briefly recall it. One starts with 



C° he ^ y = Q(x)(tD-m Q )Q( 



x 



(12) 



IS 7T 



4 The leading term Aq is often omitted here. It is then implied that the time evolution operator 
Aq rather than i-Sr in the usual Schrodinger equation. This can be consistently carried 



■dt 



out through the analysis. 



9 



and factors out of the Q(x) field the "mechanical" time- dependent factor associated 
with the rest energy mq: 

Q(x) =e- im ^Q(x) . (13) 
In an arbitrary frame moving with four-velocity t> M it takes the following form: 

Q(x) = e - im Q v ^Q(x) . (14) 

Then 

iD^Qix) = e (m Q v, + vr M ) Q(x) , 7T„ = P M - m Q v, . 

The Dirac equation (i jp — mq) Q = takes the form (now the tilde on Q is omitted) 

Z ZTflrt ZTTlrt 



% % 



p p 

1 1 v 



which allows one to exclude the small low components - = ^ S -Q(x) expressing them 
via the 'large' upper components ^-^-Q(x). 

A subtlety emerges on this route that must be treated properly: at order l/rrig 
the time derivative do appears with a nontrivial coefficient depending, for example, 
on the gluon field. This can be eliminated, and the time derivative returned to its 
canonical form performing the Foldy-Wouthuysen transformation 

^(^^H 1 ^' (16) 

Why does one need this Foldy-Wouthuysen transformation? It is worth dis- 
cussing it since HQET uses the different fields 

h(x)= 1 -±^Q(x) (17) 

to all orders in I/trq to represent the effective low-energy heavy quark field. As a 
result, for example, the higher-order terms in the HQET effective Lagrangian are 
different from Eq. (|j) starting already 1 / mq (in this order they consist of the so- 
called Darwin and convection current, or LS interaction). Moreover, h(x) is not 
mass-independent but contains explicit l/nig, 1/rriQ ... terms depending on the 
external fields. 

In principle, it is not a mistake per se. If one were able to calculate all HQET 
Green functions computing the functional integral with such a HQET Lagrangian 
exactly, it wouldn't have mattered: any choice of dynamic variables is legitimate as 
soon as the proper reduction technique is applied to obtain the S'-matrix elements 
from the Green functions. In reality, such a possibility is as Utopian for HQET as for 
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full QCD. Therefore, in practice one can only rely on various expansion techniques, 
and here using the correct nonrelativistic fields incorporating the Foldy-Wouthuysen 
transformation and the Lagrangian becomes mandatory, as was realized 70 years ago. 
Let me briefly illustrate it. 

The standard approach in the heavy quark expansion uses the well-elaborated 
methods of QM operating with the QM states, in particular, in constructing the 
perturbation theory in l/mq. In QM the terms in the expansion are expressed via 
various T-products, and this holds in the standard form only if the canonical coor- 
dinates are used. Otherwise, somewhat different objects replace usual T-products. 
Moreover, the effective Hamiltonian describing the evolution of the h fields becomes, 
in a sense, non-Hermitian in higher orders in l/mq. 

On the other hand, one wants to employ the QFT methods which, in particular, 
are used in the OPE. They rely on the equations of motion for quantum field oper- 
ators. If a system is described by the Lagrangian C, one has the classical equations 
of motion 

5C 5C 

5d^(x) 8(p(x) 

However, we always use the very same equations of motion for the quantum Heisen- 
berg operators (] f(x). The validity of these operator equations is a nontrivial fact 
(recall that for any operator A the time derivative by definition is % [H, A]), and is a 
consequence of an analogue (generalization) of the Erenfest theorem in QM. This is 
true, however, only if the term with the time derivative has a canonical (coordinate- 
independent in QM, of field-independent in the field theory) form. For the HQET 
quantum fields the naive equations of motion do not hold. This general fact was 
"empirically" observed in |L2] (see also |]13fl ) in attempts to reproduce in HQET the 



•W^X-TTTTV = 0. (18) 



corrections to the inclusive decay widths calculated earlier directly in QCD. 
They required modifications of the existed strategy of obtaining higher-order l/mq 
corrections. This was interpreted as a 'subtlety in applying equations of motion' 
T2|, PL Surprisingly, the well known origin of this subtlety has not been realized. 



Not all related flaws have been eliminated, however. The expansion of the exclusive 
heavy flavor transition at order l/mq |14| suffering from the same inconsistencies 
has never been revised even though it is used nowadays by Neubert to claim the most 
accurate evaluation of the zero recoil B — > D* formfactor. This will be discussed in 
more detail in Sect. 3.5. 

Finally, when no rigorous QCD evaluation of the nonperturbative effects can be 
made, one often resorts to simplified QM constituent quark models (e.g., ISGW [[TJJ) 
where all wavefunction overlaps, expectation values and other characteristics can be 
computed. In applying these computations to the QCD system one certainly must 
use the proper relation between the QCD fields and the variables employed in the 



5 In fact, the classical equations of motion are often modified due to anomalies. This is a different 
aspect related to the UV divergences in the theory. What is discussed here has nothing to do with 
divergences and applies even to the systems with the finite number of degrees of freedom. 
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model. As was first noted, apparently, by A. Le Yaouanc |16|], this was not always 
done properly in HQET [|14| . 



Therefore, it is certain that the Foldy-Wouthuysen transformation is not an 
obsolete, mole-beaten technique used only before the advantages of the modern 
formulations of the quantum fields theories were known or appreciated in full. Using 
the standard nonrelativistic expansion also often gives a simple derivation, or at 
least transparent interpretation of many results. Missing it may easily lead to direct 
mistakes. 



2.2 Applications to Spectroscopy of Heavy Flavor Hadrons 

To illustrate the consequences of the heavy quark Hamiltonian, let us consider the 
masses of hadrons containing a single heavy quark. This is not a dynamic question 
and does not require more than just symmetry properties of H,q. It is described in 
much detail in a number of old reviews |T^j, and I can be brief. 

The mass of a hadron Hq is given by the expectation value of the Hamiltonian: 



2M 



■{Hq\7U^\Hq) 



and we have 

where we can expand 



n 



tot 



Hq 



flight + H.q + rriQ 



with 



H.q = Ho H Hi H + 

m Q m Q 



Wo = -Vq a oVq -A)(0) [{flight}] 



(19) 
(20) 

(21) 



m. 



-n 2 



m Q 
1 

8m3 



-H x 



2m c 



7T 



+ 3B) , 



-(DE) + &■ {E x 7? - 7f x E} 



(22) 



etc. (the first terms in H, 2 is called Darwin and the second is the convection current, 
or LS term). Therefore, 



M Hq =m Q + A + 



1 (H Q \7r 2 + aB\H Q ) 



2mr 



2M 



Hq 



. x , ^g)h q , oq x 
rriQ + A H — - + ... (23) 



2VTL( 



Here we introduced the notations \j? g for the expectation values of two D = 5 
heavy quark operators which will often appear in our discussion: 



/4 = (Hq\7T 2 \Hq) 



QM 



2M 



■(H q \Qti 2 Q(0)\Hq) Q ft 
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/4 = (H Q \aB\H Q ) QM = l^(H Q \ - Q l -a, u G^Q(0)\H Q ) QF T • (24) 

The physical meaning of /i^ is quite evident: the heavy quark inside Hq ex- 
periences a zitterbewegung due to its coupling to light cloud. Its average spatial 
momentum squared is /i^. The second expectation value measures the amount of 
the chromomagnetic field produced by the light cloud at the position of the heavy 
quark. In principle, the actual heavy hadron states Hq depend on uiq via the 
1/mQ-suppressed terms of the Hamiltonian. Therefore, the above expectation val- 
ues also have such terms. Often it is convenient to consider the asymptotic values at 
rriQ — > oo, and to use, correspondingly, the eigenstates of the mg — > oo Hamiltonian 
Hq + 7iii g ht- 

The value of A = linv^oo (m Hq — mq^ has the scale of Aq C d and depends on 
the state of light degrees of freedom. These states generally carry spin j. In the 
limit rriQ — > oo the heavy quark spin decouples since the spin-dependent parts are 
present only starting Hi. Thus, the heavy flavor hadrons can be classified not only 
by their total spin J but by the spin of light degrees of freedom j. It would be just 
the overall spin of a hadron in the hypothetical world with the spinless heavy quarks 
discussed in Sect. 1.1. Unless j = 0, there are two values of the total spin J = j ± |. 
The corresponding states form a 'hyperfine' multiplets and are degenerate up to 
1/rriQ corrections. They are, for example 

j = l/2 | ^ * j = 0,J=l/2 A c , A b 

For Ag-baryons all spin is carried by the heavy quark (up to 1/rriQ corrections). 
The observed spectroscopy of these states clearly supports this picture: 

M Ab — Mb — 350 MeV M Bs - M B - ^ 90 MeV 

M Ac - M D ~ 420 MeV M Ds - M D o ~ 104 MeV (25) 

These relations are easily improved including 1 /uiq terms, Eq. (^). The opera- 
tor Qif 2 Q is spin-independent and its expectation value is the same for all members 
of a hyperfine multiplet. It does not split masses inside the multiplet. 

The chromomagnetic operator Q^aGQ = —2SqB(0) depends on the heavy 
quark spin Sq and thus lifts degeneracy leading to the hyperfine splitting among the 
members of a multiplet. Restricted to a particular hyperfine multiplet, the chromo- 
magnetic field B(0) as an operator is proportional to the operator of spin of light 
degrees of freedom: 

B = c-j. 

Therefore, 

{SB) = 2c(S Q j) = c (j(J+ 1) + 1) - |) ■ (26) 
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As a matter of fact, it is easy to see that 



]T (H Q \QaBQ\H Q ) = Tr QaBQ = (27) 
h q 

always holds if the summation is performed over a hyperfine multiplet. Therefore, 
for example, 

p%(B) + 3/4(5*) = 0. (28) 

In the A b family the expectation value of b^aGb vanishes. 

So far most of the practical applications refer to B mesons; as a result, usually 
/i 2 proper denotes the expectation value of the kinetic operator just in B or B* . 
Likewise, p 2 G = p G (B) = —3p G (B*) . Experimentally, 

1 2 a 2 

M B *-M B ~ (- + l)J^L = i^46MeV. (29) 
V 3 / 2m b 3 2m b 

Neglecting the difference between Mb + Mb* and 2m b (which is formally an effect 
of higher order in 1/mb), one can write 

/4 ~ - (Af|. - Ml) ~ 0.36 GeV 2 . (30) 

In charmed mesons Md* — Mo ~ 140 MeV which agrees with the fact that this 
hyperfine splitting is proportional to l/m c . The hadron mass averaged over a hy- 
perfine multiplet, e.g. M = 3M b*+ m b j g a ff ec ^ ec i a ^ order I/trq by only the kinetic 
energy term /x^/2m&. 

In many applications one needs to know the difference between m& and m c . This 
difference is well constrained in the heavy quark expansion. For example, 

mb - mc = M B -M D +^(^-^)+^^(— 2 -^)+o(^). (31) 

\2m c 2m b J 4 \m c iti£ J \m A / 

Here /z 2 is the asymptotic expectation value of the kinetic operator, p D is the ex- 
pectation value of the Darwin term given by the local four-fermion operator and 



p = p n7r + p s is the sum of two positive non-local correlators | 18fl . As will be dis- 
cussed in the subsequent lectures, all quantities in Eq. (|3l|) but the meson masses 
depend on the normalization point which can be arbitrary, except that it must be 
much lower than m c ^- In this way we arrive at 

m b - m c ~ 3.50 GeV + 40 MeV ^ ~ °' 5 ^ V + AM 2 , I AM 2 | ^ 0.015 GeV . 

0.1 GeV^ 

(32) 

We will discuss the values of the hadronic parameters later. Now, let me mention 
that the m b — m c estimate at p^ = 0.5 GeV 2 appears to be in a good agreement with 
the separate determinations of m b and m c from the sum rules in charmonia and T. 
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Figure 1: Quark diagrams for the exclusive B — » Z)W (a) and generic (b) semileptonic decays. 



The main uncertainty in m& — m c is due to that in the value of //^. The Darwin 
term can be reasonably estimated relying on factorization |19[| ; it is of order 0.1 GeV 3 . 
The non-local positive matrix elements and p| are expected, generally speaking, 
to be of the same order. Altogether, assuming \p\ — p 3 \ ^ 0.1 GeV 3 , one arrives at 
the uncertainty in — m c due to the higher-order terms AM 2 quoted above. 



2.3 Heavy Quark Symmetry for Formfactors 

The amplitudes of the semileptonic weak b — > c decays are described by the corre- 
sponding transition formfactors. Typical semileptonic b — > c decays as they look like 
in Feynman diagrams are shown in Figs. 1. The hadronic part of the weak decay 
Hamiltonian mediating such decays is 

^wcak = / d 3 xe^c 7M (l-7 5 )^) • (33) 

It says that b with a momentum p is instantaneously replaced by the c quark with the 
momentum p — q. The resulting state hadronizes into eigenstates of the Hamiltonian 
corresponding to mq = m c , that is, must be projected onto such states. 

The space-time picture of the decay is simple for heavy quarks. At t < the 
initial b hadron is at rest and constitutes a coherent state of light degrees of freedom 
in the static field of the heavy quark. At t — the b quark emits the lepton pair 
with the momentum q and transforms into a c quark. The c quark gets the recoil 
momentum — q and starts moving with the velocity v = —q/m c . Such a state is 
not anymore an eigenstate of the Hamiltonian, and afterwards undergoes nontrivial 
evolution. The light cloud can get a coherent boost along the direction of — q and 
form again the same ground-state, or excited hadron. Alternatively, it can crack 
apart and produce a few-body final hadronic state. 

The heavy quark symmetry per se cannot help calculating the amplitudes to 
create such final states. However, it tells one that the hadronization process does 
not depend on the heavy quark spin, or on the concrete value of the mass mq but 
rather on the velocity of the final state heavy hadron. This velocity cannot change 
in the process of hadronization if only soft gluons are exchanged between the heavy 
quark and the light cloud. This independence holds, of course, only when mq is 
very large (and the final state quark does not move too fast); there are various 
l/rriq corrections at finite masses. 
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Let us consider, for example, the ground-state transition B — > D. Its amplitude 
depends on the velocity v of D, f{v 2 ). The very same function would describe also 
decays B — »■ D* , or the elastic amplitude of scattering of a photon on the b quark 
B —> B, B —> B* . Moreover, in the proper normalization /(0) = 1 holds. This fact 
follows from the conservation of the 6-quark vector current, for the amplitude at 
zero momentum transfer measures the total '6-quark charge' of the hadron n& — ng. 
Its origin is simply understood: if v — 0, the final state is not really disturbed by 
movement of the static source. A nontrivial rearrangement of the light cloud for 
B — > D is associated only with the mass-dependent terms in Tiq vanishing when 
rn Q -> oo. 

As far as I know, this physical picture and the fact of normalization of the B — > D 
and B — > D* transition amplitudes near zero recoil q = were first discussed in QCD 



by M. Shifman and M. Voloshin in 1986 [20 



Let us consider the vector fry^b current in B meson. It is described by the single 
formf actor f + (q 2 ): 

(B(p')\h,b(0)\B(p)) = f + (q 2 )(p + p'),, q^p'^-p, (34) 

(the second structure (p + p')^ is forbidden by T invariance or current conserva- 
tion). The value /+(0) measures the total 'beauty charge' of the hadron and is not 
renormalized by the strong interaction, /+(0) = 1.0 Passing to the velocities, we use 
instead of q 2 the scalar product v^v ' : 



(PP') i (<? 

yuv j = 

and 



^) = ^=1-^>1, (35) 



M B 2M B 



f + {q 2 ) = , «1) = 1 • (36) 

C,(vv') is the Isgur-Wise function. The heavy quark symmetry then states that 

/n/ Ai- wm, r/ « (Mb + Md ( , n M B -M D ( \ 
(D(v)\^b(0)\B(v)) = [^^(P + Ph -2JMZME<P-P)>) ^(vv). 

(37) 

D* differs from D only by the alignment of the c quark spin. Taking this into account 
yields 

(D*(v',e)\c llM b(0)\B(v)) = ie^Xv^iw') ^M B M* D 



(D*(v',e)\c^b(Q)\B(v)) = {e^vv' + 1) - v'^ev)} £(vv') ^M B M* D . (38) 

It is important that these relations are valid in the limit m^ c — > oo and if no short- 
distance radiative corrections were present. The corrections to the symmetry limit 
are minimal at q = (v — v', the so-called zero recoil point); numerically the value 



of the axial formfactor Fg^D*(0) ~ 0.9 pl |. The correction at arbitrary v ~ 1 are 
generally quite significant. 

6 For simplicity, I use the convention that B consists of b and q. That is, B~ is a B meson while 
B+ is B, 
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2.4 Feynman Rules at ttiq —> oo 

The Feynman rules for heavy quarks are usual propagators and vertices for nonrela- 
tivistic particles where 1/m — > 0. If the heavy quark momentum is p M = (mq + u, p) 
then 

rv = 0.^% • (39) 

Here a, a, (3 are color indeces and spinor indeces i,j take values 1 or 2. (It is often 
advantageous to keep the nonrelativistic term p 2 /2rriQ in the propagator.) These 
rules follow immediately from the static Lagrangian 

A a 

£q = tpQiDoVQ , D = d - ig s A a Q — . 

The same nonrelativistic system can be considered in the arbitrary moving frame, 
where one can write 

C v = V +i{vD)ip v . (40) 
Instead of the nonrelativistic spinor ifq one then considers the "bispinor" ip v (x) = 



2 



Q(x); the propagator is written as 



2 mn - (p - k) 2 - it 2 2 vk - ie { ' 



T 

where k = p — rriQV. Of course, such a generalization can be useful only when the 
initial and final state hadrons have different velocities, v ^ v' . In that case the 
external ('weak') source carries a large momentum q ~ tuq Sv . 

It must be noted, however, that HQET is not quite consistently formulated as 
a quantum field theory for processes where v ^ v' when the quantum radiative 
corrections are really incorporated. Any change in the velocity of a static source 
leads to actual radiation of real hadrons with momenta k all the way up to ttiq: 

dw a s {uj) /W 2 



du) uj 



v' - v) ■ (42) 



here uj denotes the radiated energy. The non-Abelian QCD dipole radiation was 
considered in |[22|| . On the other hand, effective theories are called upon to eliminate 
('integrate out') all high-momentum subprocesses. In order to do this one would 
have to 'integrate out' not only highly virtual, but also real processes, which clearly 
is not a completely legitimate procedure. 

This is a real difficulty and not a pure mathematical subtlety. HQET as an 
effective static theory was originally introduced |§ to study the nonperturbative 
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effects in the heavy flavor hadrons on the lattice. Suppose we would try indeed to 
extract physical results for the amplitudes with a nonzero velocity transfer on a fine 
enough lattice using the HQET Lagrangian P0|). Since this Lagrangian describes 
infinitely heavy quarks, we would have to get zero for any exclusive amplitude: all 
they contain a more or less universal suppression factor 

and thus must vanish at vtlq —>■ oo (fi is a hadronic mass scale serving as an infrared 
cutoff). This factor S is the square of the well known Sudakov formfactor (more 
exactly, of its nonrelativistic analogue). 

Any actual simulation would yield, of course, a final result since lattices contain 
an intrinsic UV cutoff, the inverse lattice spacing I /a. The result, therefore, would 
essentially depend on this cutoff for any observable process we would try to compute. 
It is not too surprising from a general point of view. HQET considers not only the 
sectors with different numbers of heavy quarks as completely separate. Even the 
sectors with a single heavy quark but with different velocities are different quantum 
systems for HQET, with formally different dynamic degrees of freedom, Hilbert space 
of states, etc. On the other hand, the transition amplitudes at v ^ v ' require, for 
example, calculating the 'inner' product of states in these theories. Such a procedure 
is not quite legitimate or well-defined. Hence, it does not come as a surprise that 
the results sometimes look unphysical or prove to be fallacious. A closer look at 
some formal derivations |23], reveals that the subtleties at v 7^ v' were indeed 
neglected there. 

This means that HQET per se is not a well-formulated theory for different veloc- 
ities beyond the first order in v when the gluon bremsstrahlung appears. The intro- 
duction of a "hard" factorization scale /1 necessary for any effective theory, becomes 
here a nontrivial and not always harmless procedure. A thoughtful reader notices 
at this point that the problem roots to the essentially Minkowskian nature of the 
processes with actual heavy quarks. On the other hand, in QCD the Sudakov-type 
bremsstrahlung is to a large extent universal, the fact analogous to the fundamental 
factorization of hard and soft processes in usual OPE formulated in Euclidean do- 
main. Therefore, an alternative way of consistently formulating the OPE expansion 
for these problems seems in principle possible, even when nonperturbative effects 
are addressed. 

3 Heavy Quark Expansion in Dynamics 

The heavy quark expansion resulted in many important insights into physics of 
heavy flavor hadrons. Already the simple qualitative considerations or symmetry 
relations noted in the 80's influenced both the theoretical understanding and the 
experimental strategy in studying B particles. Nevertheless, a true measure of our 
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control over QCD dynamics is, eventually, how accurately we can extract fundamen- 
tal parameters like \V c b\, \V u b\, m^, m c . . .from experimental properties of hadrons, 
or predict necessary amplitudes. At this point we need to address more dynamic 
aspects of QCD in heavy flavors. 

A major progress over the last years was made in this direction. It is impossible to 
review, even briefly, all relevant topics here. Therefore, I am forced to limit myself 
to a brief discussion of the existing situation with, probably, the most intriguing 
subjects - extracting \V c b\, \ V u b\ and 7?v An approximate - though already nontrivial 
- determination of some of these parameters was made already a decade ago. The 
accuracy and a confidence in our estimates now are at a new level. This required a 
better understanding of various subtleties of the heavy quark theory as a quantum 
field theory derived from QCD, which are absent in simplified hadronic models. 
Moreover, explaining itself the origin of controversy in the literature often requires 
going into such 'technicalities'. As a result, a consistent review of the situation 
requires at this point a more rigorous presentation of the OPE for heavy quarks, 
discussing the precise definitions and the status of determination of the relevant 
hadronic parameters, existing uncertainties, etc. - and only then application to 
heavy flavor phenomenology. 

In these lectures I am trying to avoid technicalities in favor of outlining a more 
transparent physical picture. To get a feeling of what we need to know and how 
accurately, let me start directly with the question of the precise determination of 
\V c b\ from the total semileptonic B width, the most reliable up to date method both 
theoretically and experimentally. This will naturally lead me to the problem of 
the heavy quark masses, kinetic energy etc. in the field theory approach. These 
questions will constitute a significant part of my lectures. Then we will be able to 
return to discussing practical extraction of V c b- Such a strategy will lead to a certain 
fragmentarity of the presentation, and I apologize for related inconvenience. 



3.1 Inclusive Widths 

Inclusive widths of the heavy flavor hadrons are examples of the genuine short- 
distance processes. The decays proceed at the space-time intervals ~ l/m b (more 
precisely, inverse energy release), and the widths are affected by the soft strong 
dynamics to the minimal extent. The utility of the OPE is demonstrated by the 



QCD theorem |p5| , [26] : there are no nonperturbative corrections to the inclusive 
widths of heavy flavor hadrons. It applies to all types of decays: semileptonic, 
nonleptonic, radiatively-induced (6 — > s + 7) etc. - they only must be truly inclusive. 

In practical application of this analysis there are some limitations, the so-called 
onset of quark-hadron duality must be passed. This question will be briefly ad- 
dressed later. The best situation in this respect exists for semileptonic decays, and 
we will discuss them now in detail. It should be noted that the methods of HQET 
are not applicable to the inclusive decays. These processes have an intrinsic large 
'dynamic' scale of energy release ~ rrib, and radically differ in this respect from 
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the exclusive heavy-flavor formfactors. The OPE analysis is done directly in QCD 
utilizing this large kinematic parameter. Only at the final stage when the nonper- 
turbative effects are expressed via the expectation values of the local heavy quark 
operators, one employs the nonrelativistic expansion for the b fields [25|| . 

It was realized long ago that for heavy enough quarks the inclusive widths must 
look similar to the analogous weak decay widths of an almost non-interacting muon: 



r(u — > evv) = 7T 

VP ' 1927T 3 



2 ' 




^ Mr \-7r-ai Mr +••• 



(44) 



where zq(x) is the phase space factor; the term with ai(x) accountings for the 
electromagnetic interaction was calculated already in 1956 fl27| . Similarly, 



r„( B ) = ^ } y ct f „ g|) . K (45) 

(likewise for the b —>■ u decays), where all QCD effects, both perturbative and 
nonperturbative, are lumped together in the factor k. It was not clear, though, what 
mass must be used, the quark mass m& or the mass of the decaying hadron Mb- The 
numerical difference is yet quite significant even for B meson: M%jm\ « 1.5 to 2, 
in spite of being formally a I /nib effect: 



M B -m b A QCD M Ab -M B A QCD 



^ 



nib nib Mb nib 



(46) 



The OPE analysis in QCD unambiguously states that one must use the quark masses 
here, so that the mass factors are the same in all types of beauty hadrons. In contrast 
to hadron masses, the inclusive widths of heavy flavor hadrons do not have relative 
1/niQ splitting. This result was established by Bigi, Shifman, Vainshtein and myself 
in 1992. 

Is this property expected a prioril Yes and no. In the simple constituent models 
where the binding energy is neglected, Mb = nib + m S p- It is obvious that in such 
systems the widths are determined by m\: the mass of the spectator - as long as it 
remains a spectator - does not enter. 

This consideration is, of course, of little relevance for QCD. The mass of the 
spectator quarks in B~ , B° or A& is only a few MeV and we are not interested 
in such negligible effects. The actual question is about a few hundred MeV cloud 
whose energy, at least partially, is the interaction energy E pot . In general, this E pot 
does directly affect the decay width. In the nonrelativistic models 

M B - m b + m sp + E pot + E kin + O (A 2 QCB /m Q ) , (47) 

and -Ep 0t ~ -^kin ni sp ~ A QCD , so this effect is less important than mere the rest 
energy of the spectator quark, and is routinely discarded. On the contrary, in QCD 
one can rather neglect m sp . 
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At the same time, at least a major part of the mass dependence in Eq. ( f45| ) 
comes from the phase space available for the final states (in particular, for leptons), 
which naively knows nothing about the quark mass and rather senses the overall 
hadron mass. It is this consideration that may seem to be relevant for the QCD 
picture of hadrons. It often underlies implicit questioning of the validity of the 
QCD analysis resurrecting, in one form or another, the 1/mq corrections to the 
widths [^8], |3(| ^TJ. And, nevertheless, the dependence on the binding energy 
disappears. How it happens in the language of the summation of the contributions 
of separate exclusive final states, was illustrated in |32[] using the so-called heavy 



quark transition sum rules | I8| . Here, instead, I'll try to explain the mechanism of 
the cancellation on a transparent toy example. 

Let us consider the semileptonic decay b — > c in the kinematics where leptons 
carry away a major portion of the energy release, and the final c quark is mov- 
ing slowly (this is the so-called small-velocity (SV) kinematics first suggested as a 
theoretical tool by Shifman and Voloshin in ||J). The initial b quark undergoes a 
binding color Coulomb potential V(0) which is of order Aqcd, and this, in principle, 
affects the decay probability. However, the final c quark is produced inside the very 
same potential, and thus the energy of all final states is shifted by the same amount. 
The equality of the potential binding force in the initial and final state is guaranteed 
in this case by equality of the color 'charges' of the initial b and final c quarks which, 
in turn, is a consequence of the conservation of the color flow. Thus, it is easy to 
see that whatever strong is the binding energy by the Coulomb interaction, in such 
a case it is canceled between the initial and final state interaction due to the 'color 
charge' conservation. 

The situation may seem different when one goes beyond the SV kinematics and 
the final quark is moving fast. First, the interaction of the final relativistic quark 
is not given by mere the Coulomb potential but depends on all components of the 
gauge potential. Second, the quark promptly leaves the vicinity of the point where it 
was produced, and the potential in other space locations is quite different. Moreover, 
if the interquark potential grows at large distances, whatever large rrib is chosen, the 
final quark can never escape and would remain confined with the initial light cloud. 
In contrast, for the interaction vanishing at large distances the quark momentarily 
becomes 'free'. This may naively seem to contradict OPE since no assumptions 
about the large-distance behavior of the interaction was involved in proving the 
cancellation of the binding energy effects. 

Nevertheless, the cancellation is complete in this case as well. The gauge nature 
of the QCD interaction and the conservation of the color current ensure that the 
integrated effect of the interaction with the potential is the same for slow and 
relativistic particle. A closer look at the second problem also shows that no actual 
contradiction is present. Since the act of the decay happens for a small time interval 
~ l/E re \, its probability is sensitive only to the potential at the distances ~ 1/E ie \ 
from the origin. Details of the interaction at larger distances merely determine the 
spectrum of the final states and how exactly the overall probability is allocated 



21 



over the different final states, but not the total decay probability. For example, if 
the interaction vanishes beyond a distance ~ Aqcd> the final states belong to the 
continuous spectrum being represented by plane waves with some distortion near 
the origin. If the potential is confining, there are no real quasifree decays but the 
transitions to the bound states in such a potential. The sum of these probabilities 
reproduces the decay width into the quasifree quark states. 

This picture explains the OPE result for QCD. The latter, of course, is more 
general and does not rely on an assumption about QM potential, etc. The key point 
is that the decay is almost instantaneous, its time is of the order of It is 

for this reason the total rate is determined by the potential at \x\ ~ and 
the width is expanded in local operators bO^b where Oj contain the QCD fields at 
x = and their derivatives. In essence, the perturbative corrections represent the 
expansion in the singularity of the interaction at x = (recall, V(x) oc a s /\x\), 
and the power nonperturbative corrections are related to the terms of the Taylor 
expansion of the interaction near x = 0.[] This is a quite general property and does 
not depend on the underlying nature of the interaction. 

The conservation of the color flow and the gauge nature of the QCD interaction 
of quarks leads to an additional property: the leading term in the Taylor expansion, 
the 'potential' V(0) does not appear as a result of the exact cancellation between the 
effects of the initial and final state interactions. This is a meaning of the theorem 
about the absence of 1/rriQ corrections in the inclusive widths. 

This is an example of a general QM statement known as the KLN theorem. In the 
simple words, it states that all "sufficiently inclusive" transition probabilities must 
be completely infrared (IR) insensitive.^ However, to be "sufficiently inclusive", the 
process, in general, requires a summation over the degenerate initial states as well, 
which is not realized in any experiment. The degree to which the KLN theorem is 
valid for actual "insufficiently inclusive" transitions is a priori unknown. The OPE 
- when it applies - allows to quantify this sensitivity. 

It is worth emphasizing that the absence of 1/toq corrections in the inclusive 
widths is not an automatic consequence of the KLN theorem or any other general 
QM fact. In dynamic realizations of strong interactions other than QCD - for 
example, due to exchange of scalar particles - such corrections would be present 
and depend non-universally on various kinematic details. That is why, for example, 
the summation of the exclusive semileptonic transition probabilities in the ISGW 
model [36| did not reproduce the partonic width, especially in the h — ► u decays. 

After the general illustrations given above it is easier to outline the main technical 
steps in calculating the widths: one only needs to believe that a particular inclusive 



7 It was suggested in |33| that the effects of duality violation are related to the singularities of 
the interaction at finite distances x from the origin, in the complex plane. I do not discuss this 
aspect here. 

8 I deliberately do not explain here the precise meaning of both the assumption and of the 
statement, what is IR insensitive. It can be found in the dedicated papers p3, [3j|. 
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width is indeed given by the local operators.^ Then we can write 



r ^ = 7^( H bhbb + c G b t -aGb + J2c i bO i b + ...\H b ) (48) 
lM Hb I 

where c are short- distance coefficient functions and bOib denote local operators of 
D = 6 and higher. Note that there is no independent operator of D = 4: the width 
must be a Lorentz scalar, and the only possible operator b^ a iD a b reduces to m b bb 
by the QCD equations of motion. This is the counterpart of the cancellation of the 
Coulomb potential A is the previous discussion: gauge invariance states that A 
must always enter together with do, and this combination applied to the b quark 
field is free of any dynamics to order Aqcd by virtue of equations of motion (that 
is, including the interaction of the initial b quark). 

The expectation values of the operators in Eq. fl4"8|) scale like Aqqq (more exactly, 
it can be a power of the normalization scale /i, i.e. fi d ~ 3 ). Therefore, the relative con- 
tribution to the width of higher-dimension operators scale like (Aqcd, ^) d ~ 3 / m Q~ 3 ■ 

Eq. (|48"D is not the whole story, however, since the leading operator bb describing 
the free quark decay is also affected by the nonperturbative dynamics. Its expecta- 
tion value, however, can be expanded in l/m b : 

(H b \bb\H b ) = (H b \bj b\H b ) + V \ > + 0{l/mi) = 

Zm b 

+••■)• (49) 

The expectation value of bjob is exactly unity, which completes the proof of absence 
of 1/rriQ corrections. Eq. (f4"9|) was also derived by Bigi, Shifman, Vainshtein and 

2 

myself and first given in |25|.|3 It is worth noting that the term — also has a 

b 

transparent physical meaning it is nothing but a Lorentz dilation of the decay 
due to nonrelativistic 'shivering' (so-called Fermi motion) inside the hadron through 
the binding effects. 

Including the leading corrections, the semileptonic width has the following form 
2|, H, H, H: 



2 I 2 

4 J rni 



192 ^i J M'-if - 2 V-^. S (50) 



where ellipses stand for higher order perturbative and/or power corrections, Zo and 
ai depend on m 2 /m 2 . The value of /x 2 is not yet measured in experiment. There 

9 One possibility to show it formally is described in detail, e.g. in p3| |. Historically, an important 
role here was played by paper |37| . 

10 Recently this relation was attributed by Neubert J3^] to the later papers, in particular, to E3 . 
However, there was no such a relation there. 
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exists an inequality fi 2 > p? G which essentially constraints the interval of its possible 
values. The status of p 2 will be addressed later. Irrespectively of the exact value 
of nl, the l/rn| corrections to T sl are rather small, about —5% and lead to the 
increase in the extracted value of \V c b\ by 2.5%. The impact of the higher order 
power corrections is negligible. 

A rather detailed most accurate numerical analysis of the semileptonic widths 
is given in [41], Sect. 8.1. Let me briefly summarize it. The leading-order 0(a s ) 
perturbative corrections are known from the QED calculations in muon decay J27| . 
The l/mf nonperturbative corrections were calculated in the above mentioned pa- 
pers and decrease the width by about 5%. The l/m\ corrections were evaluated in 



f42| , [43ll and are at the level of 1%. They are negligible for extracting \V c b\. The 
part of the higher-order perturbative corrections associated only with running of a s 
in the first-order loop diagrams, the so-called BLM corrections |44]| , were calculated 



to order a 2 s H3] and to all orders Their impact appeared to be small. The 

remaining, nontrivial non-BLM ('genuine') two-loop a 2 s corrections were expected 
to be moderate. They were recently evaluated in a series of papers by Czarnecki 
and Melnikov [[17]] and were confirmed to be small. These were highly nontrivial 
sophisticated technical analyses often considered unfeasible even a few years ago. A 
resummation of certain enhanced non-BLM higher-order corrections was performed 
in p2 |. At present, no significant theoretical uncertainty remains in the perturbative 
corrections to the semileptonic width. 

Evaluating the theoretical prediction we get 

W4 = 0.0419 ( BR{B Z XJ " ] ) * (™*\ * ■ U - 0.012K - °' 5 ^ 



1 



0.105 J \ t b J \ 0.1 GeV" 



/ <Wu \ / c^s iGeV - 0.336 W p 3 \ , N 

1 - 0.01 b -f+- 1 + 0.006^^ - 1 + 0.007 T . 51 

V 50MeV; I 0.02 J \ 0.1 GeV 3 / V ; 

Here m(,(/i) is a certain short- distance mass of the b quark which is the subject of 
a special discussion in the next section, p is a positive non-local correlator men- 
tioned before (its expected scale is 0.1 GeV 3 ). The main theoretical uncertainty at 
the moment resides in the value of p 2 . We arrive thus at the model-independent 
evaluation 



V 0.105 J V t b 

( x " °- 012 ^oTg^V?^) ' (1 ± °- 015pcrt ± °- 01m " ± °- 012) ' (52) 

where the last error reflects rrig 3 and higher power corrections as well as possible 
deviations from duality. The quoted uncertainty associated with m b can be viewed 
as rather conservative; this assertion will be explained in the sections to follow. 
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Similar to the treatment of T(B — > XJtv), it is straightforward to relate the 
value of \V U }\ to the total semileptonic width T(B — > X u 



|vy = 0.00465 ( mB l~ 2 XJV) J (^f ■ (1 ± 0.025 p » ± 0.03„,J . (53) 

The dependence on ^ is practically absent here. The theoretical uncertainty in 
T(B — > X u lv) is relatively insignificant. The major problem is experimental, how 
to measure BR s i(6 — > u). For a long time it seemed unfeasible. Nevertheless, a 
possibility of extracting it from experiment with a reasonable confidence is being 
discussed since recently, and may eventually yield an accurate determination of \V u i\ 
on this route. 

I'll return to the question of T s \ and \ V c b\ later in the summary. Before proceeding 
to a more theoretical issues, it seems necessary to note that rather controversial 
opinions existed in the literature regarding the reliability of calculation of T s i and, 
in particular, the impact of higher-order perturbative corrections. Often they were 
claimed to be too large or even uncontrollable [^, |45], |50|, [HJ . In reality, the apparent 



instability was associated with not quite physical formulation of the problem and 
the confusion rooted to the problem of the definition of the heavy quark mass. As a 
matter of fact, after all refinements and thorough analysis of the corrections, we are 
basically back to square one. The expression for \V c b\ given in Ref. |^T] has changed 



by less than one percent (for the same input values of the experimental parameters). 
What became clearer in the last two years is the acceptable range of the underlying 
key QCD parameters entering the heavy quark expansion, and the size of theoretical 
corrections. I cannot help mentioning in this respect that, meanwhile, the value of 



tb (at least used by theoreticians as an input) drifted from 1.32 ± 0.09 ps [52] to 
1.57 ps. 

3.2 Problem of m Q in QCD 

The heavy quark mass is certainly one of the most important starting parameters of 
the heavy quark theory. The practical necessity to know it well is illustrated by the 
fact that the decay widths of the beauty hadrons are proportional to a high power 
of the mass: 

T(B,A b ) ~ m n h , n = 5>l. (54) 

All uncertainties in m& are magnified by the factor n. On the other hand, one can 
turn vices into virtues developing 1/n expansion and performing resummation of 



the large-5 terms. This was accomplished in [32]. Let me mention only one, the 
perturbative aspect. 

In general, the perturbative series for, say, T a \ have n-enhanced terms: 

oc 1 + nd^ + n 2 d 2 ( + n 3 d 3 ( + ... (55) 
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where d\, d 2 ... ~ 1. Even at n = 5 and a moderate value of 24 = 0.1 the higher- 
order terms are rather significant to be neglected already if a 5% accuracy in V c j, is 
targeted. Fortunately, this series can be readily summed up and the remaining 



corrections are small. However, once again it requires an accurate understanding of 
what are mj and m c . 

Meanwhile, quite controversial estimates could be found in the literature for the 
numerical values of the charm and, in particular, bottom quark masses. As a result, 
the majority of the HQET applications, for example, were focused on the quantities 
which were mainly independent on 777.5. HQET, claiming to have finally clarified 
the issue of what is the heavy quark mass, actually admitted that its determination 
with a sensible accuracy was impossible in practice. 

In appeared that these 'practical' difficulties had a deep theoretical reason. 
HQET was based on the concept of the "pole" mass of the heavy quark m^ 016 . 
However, in QCD-like theories the "pole" mass is not a physical notion. Moreover, 
remaining a purely perturbative construction, it cannot theoretically be defined with 
the necessary accuracy ||54|| , the irreducible uncertainty in defining its value is of the 
order of typical hadronic scale: 

SmQ le = few units x Aq CD . (56) 

No wonder all attempts to extract this ill-defined value yielded controversial results. 



3.2.1 What is m Q l 

In quantum field theory the object we begin our work with is the Lagrangian for- 
mulated at some high scale Mq. The mass mo is a parameter in this Lagrangian; 
it enters on the same footing; the bare coupling constant a^ -* with the only 

difference being that it carries dimension. As with any other coupling, m enters 
in all observable quantities in a certain combination with the ultraviolet cutoff Mq, 
which is universal for a renormalizable theory. 

The mass parameter mo by itself is not observable, like . For calculating 
observable quantities at the scale fi <C M it is usually convenient to relate m to 
some mass parameter relevant to the scale \x. Integrating out momentum scales 
above /1 converts into a s {p) - and likewise mo into m{p). Such m(/i) is not 
something absolute since depends on \i. It is either used on the same footing as a s {ii) 
or, in the final expressions, is eliminated in favor of some suitable observable mass. 
For example, in quantum electrodynamics (QED) at low energies {i.e. E <C m e ) 
there is an obvious "best" candidate: the actual mass of an isolated electron, m e . 
In the perturbative calculations it is determined as the position of the pole in the 
electron Green function (more exactly, the beginning of the cut). The advantages 
are evident: m e is gauge- invariant and experimentally measurable. 

The analogous parameter for heavy quarks in QCD is referred to as the pole 
quark mass, the position of the pole of the quark Green function. Like m e it is gauge 
invariant. Unlike QED, however, the quarks do not exist as isolated objects (there 
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are no states with the quark quantum numbers in the observable spectrum, and the 
quark Green function beyond a given order has neither a pole nor a cut). Hence, 
m pole cannot be directly measured; m pole exists only as a theoretical construction. 

In principle, there is nothing wrong with using m pole in perturbation theory where 
it naturally appears in the Feynman graphs for the quark Green functions, scattering 
amplitudes and so on. It may or may not be convenient, depending on concrete goals. 

The pole mass in QCD is perturbatively infrared stable, order by order, like in 
QED. It is well-defined to every given order in perturbation theory. One cannot 
define it to all orders, however; the sum of the series does not converge to a definite 
number. In a sense, the pole mass is not infrared-stable nonperturbatively. Intu- 
itively this is clear: since the quarks are confined in the full theory, the best one 
can do is to define the would-be pole position with an intrinsic uncertainty of order 

~ aqcd m • 



Based on experience in QED or ordinary QM, non-existence of rn V Q Xc may seem 
counter-intuitive. Even rigorous inequalities like A > 232 MeV were claimed j53| for 
the 'fundamental HQET parameter', the difference A = Mb — rn\ ole (the flaw was 
pointed out in J54j). The confusion did not dissipated completely and still surfaces in 
the literature. In the review article on the quark masses in PDG-96 by A. Manohar it 
was stated that there existed a third, the "HQET" mass mg^ ET which differed from 
mg° le starting a 2 in QCD with massless (light) flavors f5"5(| . That unsubstantiated 
claim is erroneous; the mass used by HQET coincided with mQ le to any concrete 
order of perturbation theory. Moreover, it is not clear what normalization point 
could have had the coupling a s in that difference: the renormalizability would leave 
as an option only the mass of the light quark. . . I hope that such statements will be 
corrected in future issues of PDG. The "HQET" mass is as ill-defined as the "pole" 
mass. 

Employing the perturbation theory, we start with the free quark propagator 

G ^ = (\ j 

(mg(yu) is the parameter entering the Lagrangian) which has a pole at p 2 = TOq(/V) 
and, therefore, describes a particle with mass mQ(fi). Accounting for the gluon 
exchanges to the first order in a s adds the diagram Fig. 2 and the pole moves to p 2 ~ 

{mQ{jj) + ^-f^j describing now a particle with the mass ~ rriQ(fi) + etc. 
In any order of the perturbation theory we see a quark pole and the corresponding 
particle with certain mass, differing from the mass in the Lagrangian. This mass is 
just the "pole" mass. Clearly, it depends on the order of perturbation theory one 
considers, and on a concrete version of the employed expansion: 

< } = m Q {n) Y,C n (£)( , C = 1 . (57) 



It is tempting to define the 'actual' pole mass as a sum of the series (0), and this 
used to be a standard assumption in HQET. It appears, however, that the sum 



27 



Q 

— — 



Q 



Figure 2: Perturbative diagrams leading to rcnormalization of the heavy quark mass. The 
contribution of the gluon momenta below toq expresses the classical Coulomb self-energy of the 
colored particle. The number of bubble insertions into the gluon propagator can be arbitrary 
generating corrections in all orders in a s . The factorial growth of the coefficients produces the IR 
renormalon uncertainty in TOq° 1c of order Aqcd • 

does not converge to a reasonable number and cannot be defined with the necessary 
accuracy in a motivated way, but suffers from an irreducible uncertainty of order 
Aqcd- 

Before explaining the origin of this perturbative uncertainty let me note that 
such a definition of the quark mass is nothing but equating it with the lowest eigen- 
value of the (perturbative) Hamiltonian in the sector with the number of heavy 
quarks 1. The eigenvalues of the Hamiltonian, however, are not a short-distance 
quantity, and their calculation in the perturbative expansion is not adequate. This 
is in the nice correspondence with the observed fact that no isolated heavy quark 
exists in the spectrum of hadrons which contains instead B, B**, A b , £? s ...with 
masses differing from each other by amount (9(Aq CD ). 

The physical reason behind the perturbative instability of the long-distance 
regime is the growing of the interaction strength a s . One can illustrate this in- 
stability in the following transparent way. Consider the energy stored in the chro- 
moelectric field in a sphere of radius R 3> 1/ttiq around a static color source, 

5£ C oui(R) oc f d 3 xE£ onl oc const - I . ( 58 ) 

Jl/m b <\x\<R IT K 

This energy is what one adds to the bare mass of a heavy particle to determine 
what will be the mass including the QCD interactions. The definition of the pole 
mass amounts to setting R — > oo; i.e., in evaluating the pole mass one undertakes to 
integrate the energy density associated with the color source over all space assuming 
that it has the Coulomb form. In real life the color interaction becomes strong at 
R ~ 1/Aqcd; at such distances the chromoelectric field has nothing to do with the 
Coulomb tail. Thus, one cannot include the region beyond Rq in a meaningful way. 
Its contribution which is of order Aqcd, thus, has to be considered as an irreducible 
uncertainty which is power-suppressed relative to tuq, 

s m m Q le = c ( Aqcd 
m Q \ rn Q 

Exactly this behavior is traced formally in the perturbation theory. In the non- 
relativistic regime when the internal momentum \k \ <C tuq the expression for the 
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diagram Fig. 2 is simple, 



4 f d A k 4vra s 4 rd 3 ka s 

6mQ ~ -3 y (2^^ = 3 y w • (60) 

The running of the coupling is generated by dressing the gluon propagator by virtual 
pairs and leads to 

4 rd 3 ka s (k 2 ) 

6m Q-o jzi^t- ■ ( 61 ) 



3 J 4vr 2 fc 2 

Since 



a,(A; 2 ) = a s (^)|l + ^^Mn-^| , b = fN c -^n fl (62) 

we can expand a s (k 2 ) in a power series in a s (fi 2 ) and easily find the (n+ l)-th order 
contribution to SrriQ, 

fe^ „fg^) n| /baMY (63) 
m Q 3 7r \ 27r / 

We observe that the coefficients grow factorially and contribute with the same sign. 
Therefore, one cannot define the sum of these contributions even using the trick with 
the Borel transformation. The best one can do is to truncate the series judiciously. 
An optimal truncation leaves us with an irreducible uncertainty ~ 0(Aq CD ) |56|, |57| . 
The above perturbative corrections are example of the so-called infrared renormalons 



k 2 )- 1 , 11 



This uncertainty can be quantified. A formal Borel resummation of such non- 
summable series leads to the result which literally has an imaginary part which can 
be taken as a measure of the uncertainty. The imaginary part for the series Eq. (|63D 
is 

8tt 

It became conventional to assign to the irreducible uncertainty the formal imaginary 
part divided by n. Even with this minimal choice 



Im< e = ^e 5 / 6 A^ D . (64) 



8m^ = i Immf = ^e 8 /«A& ^ 0.7A^ D • (65) 



7T 



"Q 
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Thus, the perturbative expansion per se anticipates the onset of the nonpertur- 
bative regime (the impossibility of locating the would-be quark pole to accuracy 
better than Aqcd)- Certainly, the concrete numerical value of the uncertainty in 
?^poie obtained through renormalons is not trustworthy. The renormalons do not 
represent the dominant component of the infrared dynamics. However, they are a 
clear indicator of the presence of the power-suppressed nonperturbative effects, or 
infrared instability of m po i e ; the very fact that there is a correction (9(Aq CD /7«q) is 
beyond any doubt. 
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It is worth noting that the pole mass was the first example where a quantity which 
is perturbatively infrared- stable was shown not to be stable nonperturbatively at the 
level Aq CD . The observation of Refs. [57|] gave impetus to dedicated analyses of 



other perturbatively infrared-stable observables in numerous hard processes without 
OPE, in particular, in jet physics. Such nonperturbative infrared contributions, 
linear in Aqqb/Q were indeed found shortly after in thrust and many other jet 
characteristics (for a review and a representative list of references see e.g. [fpj). 

To demonstrate that the problem of divergence of the a s series for wig 06 is far 
from being academic, let us examine how the 'perturbative' contribution to the b 
quark mass looks numerically: 

m P° le = m b (l GeV) + 5m peit (below 1 GeV) ~ 

4.55 GeV + 0.25 GeV + 0.22 GeV + 0.38 GeV + 1 GeV + 3.3 GeV + (66) 

where m b {l GeV) is the running mass at \i = 1 GeV, and 5m pert is the perturbative 
series taking account of the loop momenta from 1 GeV down to zero. It is quite 
obvious that the corrections start to blow up already in rather low orders! Their 
size is consistent with the estimate Eq. (|65|). Expressing observable infrared-stable 
quantities (e.g. the inclusive semileptonic width B — > X u iv) in terms of the pole 
mass will necessarily entail large coefficients in the a s corrections compensating for 
the explosion of the coefficients in m^ ole . I will return to this point later in Sect. 3.5. 

Summarizing, the pole mass per se does not appear in OPE for infrared-stable 
quantities. Such expansions operate with the short- distance (running) mass. Any 
attempt to express the OPE-based results in terms of the pole mass creates a prob- 
lem making the Wilson coefficients ill-defined theoretically and poorly convergent 
numerically. 

Since it is impossible to relate mq(/j) and ^g° le to the necessary accuracy, it 
is clear that either rriq 1 ' 2 or mg(/i) must be irrelevant for the 1/rriQ expansion, for 
example, for calculating the widths. Which one? This question was formulated and 
answered in early 1994 in Ref. f56j. In agreement with the qualitative discussion 



given above, the answer is: mg le is irrelevant and must be replaced everywhere by 
rriQ(n). In the OPE, the IR part of the pole mass is not related to any local operator 
QOiQ, and does not enter any observable calculable in the short-distance expansion. 
Since this IR piece does not enter observables, it cannot, in turn, be determined 
experimentally. The numerical instability of various attempts to pinpoint the value 
of mg* is a result of using the perturbative expansion for the effects originating 
from the nonperturbative domain. 



The above facts were later illustrated in detail in Ref. [60] in a concrete model for 
the higher-order corrections to the semileptonic widths obtained via the so-called 
bubble resummation of the one-loop perturbative diagrams. The corresponding 
IR renormalon uncertainty disappeared from the perturbative corrections when the 
width was expressed in terms of the short- distance heavy quark mass.Q 

11 A similar cancellation was claimed to be independently obtained afterwards also in jlo|. How- 
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It is important to note that the irrelevance of the pole mass goes beyond the 
problems with the IR renormalon contributions illustrated above. Even if there 
existed some way to define reasonably the sum of the pure perturbation series for 
mg° le , or asymptotic states with single-quark quantum numbers with finite energy 
existed in a strong-interaction theory like QCD, this mass still would have been 
inadequate for constructing the effective field theory, to the extent the difference 
with mg(yu) cannot be neglected numerically. 

A properly constructed perturbative treatment suitable for the 1 / mq expansion 
incorporates only gluons with \k\ > \i which are not 'resolved' and are included 
into the heavy quark field wavefunction corresponding to the heavy quark field 
Q( At )(x) normalized at the scale [i. The normalization point fi can be changed: 
descending from \i down to \i\ < fi one has to integrate out newly-resolved gluons 
with /ii < | A; | < fi. For example, the Coulomb field associated with such gluons 
increases the mass of the quark by the amount 



d 3 k 4 a s (k 2 ^ 
m<\%\<n 47r 2 3 p 



The pole mass clearly appears when fix — > 0. From the OPE point of view, it is 
an attempt to construct an effective theory with the normalization scale /i = 
formulated, nevertheless, still in terms of quarks and gluons. Speaking theoretically, 
one can imagine a limit of small \i which would correspond to integrating out all 
modes down to fi = in evaluation of the effective Lagrangian. It would be nothing 
but constructing the S'-matrix of the theory from which one can directly read off all 
conceivable amplitudes. Clearly, it could have been formulated in terms of physical 
mesons and baryons but not of quarks and gluons! Such inconsistencies stemmed 
from postulating in HQET technicalities like using the dimensional regularization 
as an indispensable tool to formulate the theory at the quantum level. 

In the Wilson OPE one uses mg(/i) with Aq CD <C \i <C m b . I'll illustrate later 
that just such a mass can be accurately measured in experiment. It is /i-dependent: 

dm Q {fi) 16a s (/i) 4a s (/i) fi f 2 /i 2 \ 

— A = - 7T o + °\ a s, a s— ; (68) 

d/i 9 7r 3 7r vtlq \ mq J 



the higher order perturbative corrections were computed recently [p^ ]. There are 
different schemes for defining rriQ(ii) (similar to renormalization schemes for a s ), and 
the coefficients above are generally different there. As long as a concrete scheme 
is adopted, there is no ambiguity in the numerical value of mQ(/j,). Instead of the 
HQET parameter A in QCD one has A(yu) = limm^oo Mh q — mg(/i). The value of 
A(/x) is of the hadronic mass scale if /i does not scale with rriQ. 

There exists a popular choice of a short- distance mass, the so-called MS mass 
m(/i). The MS mass is not a parameter in the effective Lagrangian; rather it is a 

ever, there is no a single calculation of the perturbative correction to the width in that paper. 
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certain ad hoc combination of the parameters which is particularly convenient in 
the perturbative calculations using dimensional regularization. Its relation to the 
perturbative pole mass is known to two loops pT| : 



< = m Q (m Q ) jl + + (1-566- 3.73) (f)V... } (69) 

At n > itiq the MS mass coincides, roughly speaking, with the running Lagrangian 
mass seen at the scale ~ \x. However, it becomes rather meaningless at /x <C niq: 

ffiQ(fi) ~ m Q (rh Q ) jl + —In — I . (70) 
{ 7T /i J 

It logarithmically diverges when n/rriQ — > 0. For this reason m(/x) is not appro- 
priate in the heavy quark theory where the possibility of evolving down to a low 
normalization point, /i <C mq, is crucial. Otherwise, for example, M# Q —fhq oc mg 
and does not stay constant in the heavy quark limit. 

The reason for this IR divergence is that the MS scheme technically attempts to 
determine the (perturbative) running of all quantities by their divergences calculated 
when the space-time dimension approaches D — 4. The divergence can emerge 
only at k —>■ or k —>■ oo. For the mass, the IR divergences are absent, and 
the dimensional regularization is sensitive only to the UV divergence at k 3> m,Q. 
Studying only 1/ (D — 4) singularities, it is in principle unable to capture the change 
of the regime at /i ^ tuq and assumes the same running in this domain. The actual 
running below ttlq is much slower. Such a mistake was made, for example, in || 
where the logarithmic running of the mass was stated for arbitrary scale. 

The properly defined short-distance masses always exhibits an explicit linear in- 
dependence similar to Eq. ( |68D at // <C tuq. The perturbative pole mass, order by 
order, would correspond to the limit jj, — > 0. However such a limit does not exist. 

3.2.2 Which mass is to be used? 

Since rriQ 0le does not exist as a well-defined mass parameter, a different, short- 
distance mass must be used. The normalization point fi can be arbitrary as long as 
/i ^> Aqcd- It does not mean, however, that all masses are equally practical, since 
the perturbative series are necessarily truncated after a few first terms. Using an 
inappropriate scale makes numerical approximations bad. In particular, relying on 
mg(mg) in treating the low-scale observables can be awkward. The following toy 
example illustrates this point. 

Suppose, we would like to exploit QCD-like methods in solving the textbook 
problem of the positronium mass, calculating the standard Coulomb binding energy. 
To use the result written in terms of the MS mass one would, first, need to evaluate 
rh e (m e ). This immediately leads to technical problems: rh e (m e ) is known only to 
the order a 2 m e ; therefore, the "theoretical uncertainty" in rh e (m e ) would generate 
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the error bars ~ a 3 m e in the binding energy, i.e. at least 0.01 eV. Moreover, without 
cumbersome two-loop calculations one would not know the binding energy even at 
a few eV level - although, obviously, it is known to a much better accuracy (up to 
a 4 m e without a dedicated field-theory analysis), and the result 



M P = 2m e (0) - + 0(a 3 m e ) (71) 

is obtained without actual loop calculations! 

Thus, working with the MS mass we would have to deal here with the "hand- 
made" disaster. The reason is obvious. The relevant momentum scale in this prob- 
lem is the inverse Bohr radius, //# — ~ cxm e . Whatever contributions emerge 
at much shorter distances, say, at ~ ra~ x , their sole role is to renormalize the 
low-scale parameters a and m e . If the binding energy is expressed in terms of these 
parameters, further corrections are suppressed by powers of /is/yU. Exploiting the 
proper low-energy parameters ct(0), m e (0) one automatically resums 'tedious' per- 
turbative corrections. In essence, this is the basic idea of the effective Lagrangian 
approach, which, unfortunately, is often forgotten. 

In the perturbative determination of mb(fi), for example, the importance of 

higher-order corrections becomes governed by rather than m&. Since 

fi ^ rrib, this is a significant improvement. 

Needless to say, it is the high-scale masses that appear directly in the processes 
at high energies. In particular, the inclusive width Z — > bb is sensitive to m^Mz)', 
using MS mass normalized at /i ~ Mz is appropriate here. On the contrary, the 
inclusive semileptonic decays b —* c£u are rather low-energy in this respect fl32| . 
and, to some extent, that is true even for b — > u. 

The construction of the running low-scale heavy quark mass suitable for the OPE 
to any order in perturbation theory can be done in a straightforward way. However, 
the Wilsonian approach implies introduction a cutoff on the momenta of the gluon 
fields. Since gluon carries color, its momentum is not a gauge-invariant quantity, 
and such a mass typically appears not gauge-invariant. More accurately, obtaining 
the same TOq(^) requires somewhat different cutoff rules in different gauges. 

Even though this is not a real problem for the theory, it is often viewed as a 
disadvantage. To get rid of this spurious problem, a manifestly gauge-invariant def- 
inition of the running mass was suggested in which is formulated only in 
terms of observables. Moreover, it is convenient for the OPE in the I/tuq expansion 
since more or less directly enters many relevant processes, e.g. heavy flavor transi- 
tions or the threshold heavy flavor production. Its relation to the MS mass is known 
with enough accuracy |p2[ . To avoid ambiguities I always use this definition unless 
other convention is indicated explicitly. 

It is important to note the following fact. In the relativistic theory for a particle 
with mass m one always has p 2 = m 2 , that is, E = \Jm 2 + p 2 . In the nonrelativistic 
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expansion, therefore, 

E = m+ f- - + ... ; (72) 

all coefficient are fixed in terms of powers of m by Lorentz invariance. As will be 
mentioned in Sect. 3.3.4, in applications to heavy quarks it is typically advantageous 
to use such a Wilsonian cutoff which preserves usual QM properties for the price 
of violating Lorentz invariance (the reasons are discussed in [0]). Then the mass 
parameters in Eq. (|72|) generally become different: 

1 ^-^2 



H Q = m ip+ip Q - <PqA <Pq + {(iD) + c G aB) - ... (73) 

even for a "quasifree" quark in the effective theory. Of course, the differences be- 
tween rrii appear only due to perturbative corrections: 

m;(/i) - m k (ii) = 0(a s /i) . 

Moreover, the differences can be calculated perturbatively and are completely free 
of any IR effects existing below the scale fi. For example, in our scheme mo(/i) ~ 
m 2{^) + | as ^f l ■ We use the mass m 2 (/i), the mass that enters the kinetic energy 
operator p 2 /2m; it is the most important mass for heavy quark transitions. 

This calculable difference between different "masses" for the same quark must be 
properly accounted for in the OPE analysis. Probably, the most obvious place where 
it is important is the QQ threshold physics discussed in Sect. 3.2.3. While in the 
short- distance expansion the free quark threshold starts at 2m (/i), the propagation 
of heavy quarks or the bound state dynamics are actually determined by m 2 (/z). The 
shift in the position of the threshold which serves as a reference point for energy in 
the nonrelativistic system must be properly taken into account. 

Concluding this section, I would like to make a side remark concerning the t- 



quark mass |4l|. The peculiarity of the t quark is that it has a significant width 
T t ~ f GeV due to its weak decay. The perturbative position of the pole in the 
propagator is, thus, shifted into the complex plane by — ^T t - The finite decay width 
of the t quark introduces a physical infrared cutoff for the infrared QCD effects [52 . 



In particular, the observable decay characteristics do not have ambiguity associated 
with the uncertainty in the pole mass discussed above. The uncertainty cancels in 
any physical quantity that can be measured. That is not the case, however, in the 
position of the pole of the t-quark propagator in the complex plane (more exactly, 
its real part). The quark Green function is not observable there, and one would 
encounter the very same infrared problem and the same infrared renormalon. The 
latter does not depend on the absolute value of the quark mass (and whether it is 
real or have an imaginary part). Thus, in the case of top, one would observe an 
intrinsic - but artificial - infrared renormalon uncertainty of several hundred MeV 
in attempts to relate the peak in the physical decay distributions to the position of 
the propagator singularity in the complex plane. 
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While this fact will pose no practical problem any time soon, it will do so in 
the future, in particular when analyzing top production at linear e + e~ colliders. In 
my opinion, there is no point in expressing the fundamental experimental findings 
in terms of unphysical parameters which, additionally, cannot be defined even the- 
oretically. The observables can be conveniently expressed in terms of a mass m t 
defined at the scale \i ~ T t that is free from the renormalon ambiguities. It can be 
defined and measured without intrinsic limitations. It is worth trying to come to 
an agreement in advance what kind of top quark mass should be listed in the PDG 
tables, and thus avoid the problems which accompanied attempts to pinpoint the 
masses of c and b quarks. 



3.2.3 The numerical values of m c and mj 

The mass of the c quark at the scale ~ m c ~ 1 GeV can be obtained from the 
charmonium sum rules [p3]| , m c (m c ) ~ 1.25 GeV. The result to some extent is 
affected by the value of the gluon condensate. To be safe, we conservatively ascribe 
a rather large uncertainty, 

m c {m c ) = 1.25 ±0.1 GeV. 

There are reasons to believe that the precision charmonium sum rules actually de- 
termine the charmed quark mass to a better accuracy. 

An accurate measurement of rrib is possible in the bb production in e + e~ anni- 
hilation. Since we want to know mj, with comparable or better absolute precision, 
both the data and calculations, at first glance, must have increased accuracy. The 
data are available, however, only below and near the threshold. Certain integrals 
(moments) of the cross section over this domain are particularly sensitive to the 
low-scale mass rrib(fi) with /i in the interval 1 to 2 GeV. A brief description of the 
method can be found in the review HI]]; the details are given in the original papers 
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The value of extracted from the fit, taken literally, has a very small un- 
certainty, and the actual error bars lie in the accuracy of theoretical formulae, in 
particular, in the perturbative corrections. Nevertheless, the leading Coulomb cor- 
rections have been resummed, and the expected dominant BLM-type corrections 
accounted for. We argued in that the uncertainty in rrib(fi) with fi « 1 GeV 
must not be worse than 30 MeV, and suggested that a ±50 MeV assessment yields 
a safe interval: 

m 6 (lGeV) = 4.64 GeV ±0.05 GeV. (74) 



The definition corresponds to the concrete renormalization scheme |)5|, [!2|, [II]] which 
we consistently use. In other words, the "one-loop" pole mass is 

m ( b 1] = 4.64 GeV +— — • 1 GeV ~ 4.83 GeV (75) 

9 7T 
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at a s = 0.336. The result is, of course, sensitive to the choice of fi. The correspond- 
ing value of A(l GeV) w 0.6 GeV. 

The heavy quark masses can be measured, in principle, by studying the distribu- 
tions in the semileptonic B decays [p£, 66]. Such analyses were undertaken recently 



p7| , E3 ] . Unfortunately, the data are not good enough yet to yield a competitive de- 
termination. On the theoretical side, there are potential problems with higher-order 
corrections due to not too large energy release mj — m c ~ 3.5 GeV and/or relatively 
small mass of the c quark. In particular, the effect of higher-order power corrections 



can be noticeable. The result of analysis reported in Refs. |)7|, g§] is compatible 
with the value (|73|) within the uncertainties. Unfortunately, the results are quoted 
for the pole mass obtained in a certain approximation and, thus, only the errors 
of the fit itself are sensible. The theoretical uncertainties were not even addressed 
consistently. 

I think that direct determinations oimb(mb) will long suffer from the uncertain- 
ties at least 100 to 200 MeV. This is a minimal scale of the second-order corrections 
in the high-energy measurements. In order to extract rrib one has to consider short- 
distance observables which essentially depend on m&. Exact calculation of the 0(a 2 s ) 
corrections with massive particles is extremely difficult. Simultaneously, the absolute 
accuracy in the measurements has to be very good for a competitive determination 
of m&. 

A much larger uncertainty in the b quark mass is often cited in the literature, and 
it would not be justified to avoid discussing this controversy . For example, Neubert 
inflates it up to ±300 MeV [[50], |51| . It is not clear where such an uncertainty comes 
from; I suspect that it merely refers to the variation of numerical values one would 
get using different schemes to define theoretically what is the heavy quark mass. It 
makes no more sense than, say, calling the difference in the values of MS and MS 
strong couplings at the same scale the uncertainty in a s . 



Some scepticism to the determination of mj, rose after Ref. |){| obtained - using 



the analysis similar to the Voloshin's [64| - a significantly lower value of m&. The 



new analysis was not quite correct, however. Its deficiencies were discussed in detail 
recently in |F(J. Among other things, it was illustrated that including the missed 
resonant contributions in the used sum rules reproduces the Voloshin's numbers. 
Nevertheless, the general sceptical attitude seems to prevail up to now, and I feel 
appropriate to present some complimentary arguments why there hardly is a room 
for large uncertainties here. The controversial estimates once again seem to be 
related to the attempts to determine nonexisting pole mass of the b quark. 

In determination of mj from e + e~ — > bb the experimental input is moments I n 
of Rb{s) which is related to the cross section. The dispersion relations equate them 
with the derivatives of the correlator of the b quark vector currents ILj at g 2 = 0: 

27r 2 T r dsRb(s) 



,„ n '" (0) = = / 
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J ds R b (s) exp| — (n+1) 



J ds R b {s) exp I -(n+1) ( ^— - 1 ) \~ ( 7G ) 



2M r 2 1 n s) 1 J dE R b ((M T(1S) + E) 2 ) e -s , A ~ m b /n 
The moments I n at large n > 10 are very sensitive to the value of m b : 



T -2n— i ,„„s 

4 a: e m b . (77) 

Moreover, the leading nonperturbative effects are described by the vacuum gluon 
condensate (G 2 ^), so these corrections are very small unless n is taken too large to 
push the relevant momentum scale ~ m b /y/n down to a hadronic scale. However, 
one has to pay the price of resumming the enhanced perturbative Coulomb correc- 
tions for such a convenience. In the potential interaction the expansion parameter in 
the nonrelativistic problem is a s /\v \ rather than a s itself, and typical \v\ ~ l/\/n. 
The Coulomb corrections also contain numerically large factors. For example, the 
resummation of the first-order Coulomb exchanges yields the factor ranging from 
10 to 50 for n — 10 to 20 (roughly speaking, it is e 3^™^ fts in the relevant domain 
a s ^fn ~ 1) []64|. Naively, without a precise resummation in the theoretical expres- 



sions for the moments one cannot get anything reasonable for the b quark mass. 
Even worse, since the effect of resummation is so large, one may naturally harbor 
doubts whether the higher-order effects left over are really harmless. In my opinion, 
all such fears are exaggerated. 

In reality, the majority of the analyses treated the sum rules for the moments not 
quite consistently from the basic theoretical perspective, regarding the role of the 
infrared effects. The OPE states that the (leading) effect of the IR QCD dynamics 
in the moments I n is given by the gluon condensate (G 2 ) only if the fixed m b is a 
short- distance mass m b (fi). The Coulomb potential interaction, on the other hand, 
is resummed in the standard QM way which is written in terms of the pole masses. 
In this case the naively used OPE statement is not applicable, and the impact of the 
IR domain on I n is much larger for large n. Keeping m^ ole fixed in the theoretical 
expressions rather adds a huge factor ~ e 2n&rn l l m b to I n , where <5m^ ole is the 
shift in the pole mass due to switching on the a s -corrections in the IR domain of 
momenta below a certain scale fi. For essentially one-loop calculations, for instance, 
5ml° le ~ ^/i. Any modification of the used potential at large distances (small 

exchanged momenta) would lead actually to a drastic change in I n (ml° le ^j . This 
would be just a reflection of the fact that the moments in terms of the short- distance 
mass remain the same, and the difference between the pole mass and m b ([i) changes. 

If instead one keeps m b (fi) fixed, the dependence of /„ on the IR part of gluon 
exchanges disappears. This can can be checked explicitly. It is important to remem- 
ber, however, that this is a remarkable property of QCD. Exchange of massless scalar 
particles, for example, can perfectly imitate the Coulomb interaction of nonrelativis- 
tic particles, but the relative sign of the Coulomb interaction and the corrections 
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to the self-energy would be the opposite for them. This remark suggests that one 
should be cautious applying conclusions drawn from ad hoc nonrelativistic models 
to the QCD analysis of the T system. 

The quantum mechanical interpretation 

Let us illustrate in the simple language the OPE statements referred to above. The 
largest and most dangerous in practice IR contribution to the pole mass is linear in 
the momentum scale, Eq. (p0| ). The expression for the mass shift, actually, is nothing 
but self-interaction |Vir(0) where Vir is the heavy quark potential mediated by the 
gauge interactions with momenta below certain /x <C |54], |56j. Yet the mass 
of the bb system includes also the same Coulomb interaction between quark and 
antiquark. Since for the colorless bb state the sum of color "charges" is zero, these 
effects cancel each other for the quanta with wavelength less than the interquark 
spacing r. Therefore, for the Fourier transform of the potential V(q) in terms of 
which 

v <® = / W? m) ' (78) 

only the components with \q \ > 1/r contribute. The softer exchanges are suppressed 
by powers of the multipole factor q 2 r 2 . 

This, of course, automatically emerges in all calculations. Let us single out the 
effect of gluon exchanges with \q\ < fi: 

™ - ~U S vii)e "^ = - v ' + r vv > ~ ■•■ • (79) 

v 0= f £Lv(i), v 2 =[ £%£v(3), ... 

J\1\<m (2tt) 3 J\q\<v (2tt) 3 fl 2 

(the minus sign reflects the fact that the second particle is an antiquark, i.e. negative 
C parity of the vector current). If quarks reside at distances much smaller than 
the soft potential is just a constant. Its sole role is only to shift the energy of all bb 
states by a constant amount —Vq. It does not affect wavefunctions and, therefore, 
does not modify the coupling of the virtual photon in e + e~ annihilation to these 
states. 

However, a constant potential cannot change the energy of the neutral system, 
whether the field is classical or quantum. It means that just the opposite shift is 
present in the sum of masses of b and b renormalized by the same gauge interaction. 
Indeed, explicit expression ( j60|) shows that 5m b amounts to a half of V , so that 
the change in the sum of masses cancels the constant term in the potential for 
closely spaced bb. Once again the IR effect distinguishing the pole mass from mj,(//) 
disappears from the short-distance quantity! 

For the potential mediated by exchanges of a hypothetical scalar 4>(x), the "scalar 
charges" of b and b would have the same sign, so their sum is nonzero. The quark 
self-energy would not cancel anymore the shift in the potential but rather double 
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it. In the OPE language this contribution to the moments I n (and, eventually, to 
tuq) would correspond to the expectation value of 2 . In QCD the operator A% is 
forbidden by gauge invariance. 

The above reasoning, while illustrating the cancellation of the IR effects present 
in the pole mass, is too naive in many aspects. If these were the only grounds for the 
QCD analysis, it would have been hardly relevant at all, at least for b particles. The 
actual space separation between quarks even in the lowest T(1S) state is not much 
smaller than Aq CD which is still strongly affected by the nonperturbative dynamics. 
For higher states the situation is even less favorable. 

Additionally, the relation between the pole mass and the constant term in the soft 
potential does not hold in QCD even in perturbation theory in high orders, where 
the definition of the potential itself becomes nontrivial (the subtle large-time effects 
in the heavy quark potential not governed by the scale of the momentum transfer 
were discussed long ago in J7I|). As a matter of fact, the heavy quark potential is 
not a truly short- distance quantity [2^] , either in coordinate or momentum space at 

r < a qcd or W\ > A QCD- 

Nevertheless, the OPE ensures that the IR dynamics affect determination of the 
short-distance mass mj(/i) even less than can be concluded from the naive potential 
description. The reason is that the existing strategy suggested already for 
charmonium (where perturbative potential treatment is out of question) studies 
genuinely short- distance inclusive quantities like moments and not energies or 
wavefunctions of individual states. This leads, actually, to a limited applicability 
of the simple potential picture for soft power corrections and to disappearance of 
unwanted complications mentioned above. For example, there are no effects ~ Aq CD 
(or /i 3 , in general, for \i <C rrib/n) in the moments I n which naively can be anticipated 
from the curvature V2 of the potential in Eq. (|79|). 

The key point is that the heavy quarks are pairly produced from a point with 
the limited energy A ~ m^/n; the pair thus exists for typical time Q t ~ 1/A. The 



quark velocity is small, v ~ yA/mfc. Even though the time separation between 
bb creation and annihilation can be large, > Aqcd, the quarks still cannot move 
apart to a significant distance, and typical r ~ v t ~ 1/y/Arrif, ~ ^fnjm^ is small 
compared to Aq C d- Their interaction with gluon fields is then only dipole, i.e. weak, 
and the pair propagates through an almost transparent medium. This applies to 
both nonperturbative field configurations and to perturbative gluons with \q \ < 1/r. 
Therefore, whatever complicated is the structure of the bound states in an interquark 
potential or behavior of the wavefunctions, the heavy quark propagation between 
the points of creation and annihilation will be perturbative. 

It is also clear that the applicability of the potential picture for calculating the 
corrections to the moments in the fiducial domain is limited. The literal potential 
description is adequate when particles reside long time at a certain distance. This 

12 In this analysis "energy" A, time t, velocity etc. are Euclidean or, actually complex. This 
does not change the qualitative reasoning described here. 
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is not the case: since the quarks are produced pointlike, they remain close to each 
other. Already the dipole interaction grows with distance, therefore the static in- 
teraction properly accounts for a relatively small 'middle' part of the lifetime of the 
pair. The period of expansion from the point of creation to the typical distances 
is not static. As a result, the naive potential estimates do not always hold literally 



0- 



The actual short- distance OPE expansion is applicable when A ~ 1/t ^> Aqcd- 
It predicts a small leading IR effect ~ (G 2 a p) / (£?m b ) . There is no a potential- 
generated contribution ~ (r 2 ) t fi 2 V 2 ~ /U 2 V2/(A 2 m fe ) which would dominate at t <C 
Aqcd- This can be interpreted as the fact that the soft IR part of the potential 
even at small distances (large spacelike momentum transfer) depends on the time 
interval and is suppressed for the potential "switched on" for only a too short period 
of time. Similarly, the above mentioned long-distance effects in the large-g potential 
V(q) associated with the large time scale nonlocalities in V do not emerge here. 

At A ^ Aqcd the literal OPE does not apply. Nevertheless one still can use 
the usual multipole expansion in space coordinates. The effect, in general, is not 
expressed in terms of local in time expectation values but depends on the time 
correlators. A simple estimate for the IR effect 

in i if cItI-:(o.())i-:(t.())) ^ 



lo ' A 3 m b 

qualitatively agrees with the OPE at A ~ Aqcd- This correction is still small. 

It is important that, although the literal OPE is justified at m b /n ^> Aqcd, the 
smallness of the IR contributions and applicability of the perturbative treatment of 
moments requires only p b ~ ^ Am b ~ m b /y/n ^> Aq C d- The energy resolution 
A = m b /n can be much better than Aq C d for large m b , but the duality between the 
short- distance expressions for the moments and their actual saturation by hadronic 
states will still hold. The dedicated discussion of the quarkonium production near 
threshold and OPE can be found in the old paper |[72|. 



After this digression in the meaning of the OPE for the moments of the spectral 
function let us return to extraction of m b . Does the above understanding help in 
practice? We know that the moments are sensitive to a short-distance mass m b (n), 
and it means that the QCD corrections to it must be moderate. The effective scale 
\i depends on n and for the fiducial range 8 — 20 used in ]64|| is 1 to 2 GeV. Then 



let us, for a moment, leave out the resummation machinery of Voloshin at all and 
attempt to determine mfc(lGeV) from simply the tree-level theoretical expressions, 
in the same range 8 to 20. The result lies in the interval from 4.58 to 4.64 GeV. 
Even without accounting for any QCD corrections one gets the correct result with 
a 50 MeV accuracy! 

If the "bare" result for m b is m b ~ 4.6 GeV, one should more consistently eval- 
uate the effect of the Coulomb resummation in I n comparing them with the (bare) 
moments evaluated with this short-distance mass. If we do this, the apparent im- 
pact of resummation changes dramatically: the resummation increases moments by 
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a factor ranging only from 3 to 5. In fact, for a more natural, higher renormaliza- 
tion point \x ~ 1.5 GeV the impact becomes even smaller and changes a little with 
variation of n. For n 3> 1 it is this n-dependence that pinpoints the value of nib- 

Of course, before incorporating the perturbative effects one cannot accurately 
check the normalization point dependence of rrib(fi) and answer the question to 
which exactly /i such a tree-level determination of m;, refers. This requires account 
for the perturbative corrections in the theoretical predictions for the moments. It is 
clear that this scale lies in the interval 1 to 2 GeV (depending on n), and is neither 
mi, n or what would lead to m^ ole . 

We see that already the tree-level computation yields the correct value of the 
short- distance mass mfe(lGeV) with a 50MeV accuracy. The precision is further 
improved by the straightforward Coulomb resummation. The existing analysis |53 
is additionally improved by incorporating the next-to-leading a™ +l /\v \ n corrections 
and the presumably dominant BLM a 2 s effects. Therefore, there is no room in sight 
for larger uncertainty in determination of m&(//). This explains why the most labor- 
consuming analysis of the next-to-leading effects in |M| which was necessary for 
getting a sensible determination of a s , led to a modest change in the numerical 
value of mi, about only ~ 15 MeV, as compared to the ten- year old analysis of 
Ref. m 



One of the problems inflating the apparent uncertainties is that most of the 
analyses of the T family attempt to determine ni^ ole . The uncertainty here, indeed, 
cannot be decreased below ~ 200 MeV. Using the MS mass, on the other hand, 
makes mandatory accounting for a 2 effects exactly, which seems unfeasible. 

The way which would manifest smallness of the perturbative corrections to ex- 
traction of mb(fi) at any stage is to follow the idea of the Wilson OPE more literally. 
Namely, one first gets rid of the high gluon momenta k > nib passing to the effective 
low-energy nonrelativistic theory of b and b. For example, apart from evolving nib 
to a lower scale, a certain finite renormalization of the 7*66 coupling is generated. 
This is typically accomplished in the existing analyses to a certain extent. However 
we should also separate out the momenta \k \ < fi from the low-energy theory, in 
particular from the potential. Their effect on the moments I n is small and, if nec- 
essary, can be restored by adding the OPE contribution in the form of (G 2 )^ ert (for 
small n) or the perturbative part of the correlators in the dipole expansion. Fitting 
the moments one would get directly nib{^)- 

Since only a relatively small domain of gluon momenta, between 1 to 1.5 GeV 
and about 4 GeV would remain, the effect of the exchanges to be computed will be 
very moderate. For example, the potential will be peeled off from both hard and 
long-range parts, the spectrum of states will be modified mildly, etc. 

As a matter of fact, a similar procedure was already done by Voloshin in early 
1995. Instead of using the literal Coulomb potential with small non-Abelian correc- 
tions, the low-momentum part was removed, 

Mr) = L £te-^V(q) (80) 
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which yielded a certain /i-dependent potential fast decreasing and oscillating at 
large distances. Since the problem with the Landau pole was absent at ji ^> Aqcd, 
it was possible to use the all-order BLM-resummed potential instead of only simple 
BLM scale fixing. The quark Green functions were calculated in this potential, and 
the theoretical expression for the moments explicitly depended on /i. Fitted ra&(/x) 
depended on /i, and the normalization point dependence of mass agreed well with 
the perturbative /i-dependence similar to Eq. (|68|). 

This analysis just led to the result Eq. ([74] ) (with the uncertainty about 20 MeV); 
this investigation, however, has not been officially published. (It was referred to 
already in paper |48[|, Ref. 23.) In the subsequent papers I always used and quoted 
this evaluation of m^l GeV); it did not actually differ from the value given in [64j 



within the stated error bars. I hope that an accurate analysis of this type will 
eventually appear. 

Completing the discussion of determination of mi, I would like to comment briefly 
on another outcome of the analysis of the near-threshold e + e~ — > bb cross section. 
The overall normalization of the moments I n is also sensitive to the value of a s at 
the scale 1 to 2 GeV, which suggests a way to accurately determine it. It was done in 
|64 | and resulted in a value on a smaller side corresponding to a s (Mz) — 0.110, with 
a very small quoted error. It is possible, however, that the actual uncertainty in the 
short- distance a s is larger here. Since T's determine a s at a relatively low scale, the 
effect of higher-order terms can be rather significant. In other words, one determines 
accurately here a certain effective coupling. Although it can be, in principle, related 
to the standard MS coupling, this relation is not known with enough precision. 
Moreover, since the MS coupling is quite unphysical being selected to satisfy only 
the requirements of convenience in technical computations, the higher-order terms 
in the relation between the two couplings are expected to be large. The perturbative 
relations can be poorly convergent - if, in particular, Aqcd happens to be on the 
higher side of the existing estimates. For this reason, in my opinion it may be 
premature yet to assign a too small error in the determination of the MS coupling 
a^ s evolved to higher energies. 

At the same time, it is important that the associated uncertainty in a s does 
not affect mfc(/i) with /i m 1.5 GeV. Other masses like (m&) or various approx- 
imations for mf ole are bound to exhibit sensitivity to it since involve a significant 
a s -dependent evolution from that intermediate value of \x. 



3.3 nl and /jlq 

The heavy quark masses m c , mt,, being the key parameters in the HQE, are to a large 
extent 'external' to the properties of the effective low-energy theory itself. There 
are two nonrelativistic heavy quark operators in the Hamiltonian; their expectation 
values Eq. ( |2"4j ) in the heavy meson B play a key role in many applications. In 
contrast to tuq, they are determined by the QCD dynamics itself. We cannot yet 
calculate theoretically their values from first principles of QCD since it would require 



42 



more or less exact solution of QCD in the strong coupling regime. Instead, we can 
try to measure them extracting from known properties of hadrons. 
The value of fj, G is known: since 

1 - % 

~ Q t> Q pv Q 



2m Q ^2 

describes the interaction of the heavy quark spin with the light cloud and causes 
the hyperfme splitting between B and B*, 

/4 ~ ^2m b {M B , - M B ) ~ - (A*f, - M§) « 0.36 GeV 2 . (81) 

In actual QCD p? G logarithmically depends on the normalization point; usual one- 
loop diagrams yield 

In the mass relation given above the operator is normalized at the scale fi ~ m^: 



±nr = -cM^Sk , Cc a M^S 1 . (83) 

m Q 2m Q \ a s {fi) J 

Evolving perturbatively to the normalization scale \i ~ 1 GeV slightly enhances the 
value of Hq(h), but this effect is numerically insignificant, and we usually neglect it. 

The kinetic expectation value ^ has not been measured yet. It enters various 
distributions in semileptonic decays or processes of the type b — > s + 7. However, 
it remains rather uncertain at the moment. First attempts to extract it from the 
semileptonic distributions were reported, however so far the outcome suffers from 
large, partially artificial uncertainties and is inconclusive. The inconsistency in 
treating the quark masses propagates to the kinetic operator; since it is a subleading 
I/tuq effect, making up for this is already more tricky than for itself. 

Historically, the first attempt to determine the average of the kinetic operator 
from the QCD sum rules was Ref. |74] where a negative (!) value w — 1 GeV 2 



was obtained. The treatment of the sum rules was not legitimate, however (the 
result was eventually retracted by the author.) Shortly after, a more thoughtful 
application of the QCD sum rules yielded [73] /i 2 ~ 0.6 GeV 2 . The prediction was 
later refined by the authors 



/4 = (0.5 ± 0.15) GeV . (84) 



Meanwhile a model-independent lower bound was established [0, [T7], ^T|, [L8| 

/i 2 > /4~0.4GeV 2 (85) 

which constrained possible values of /z 2 . It is worth emphasizing that this inequality 
takes place for /z 2 normalized at any point /1, provided [j? G is normalized at the same 
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point. For large fi it becomes uninformative; so, it is in our best interests to use it 
at \i — several units xAq CD ^ 1 GeV. 

Recently, a new QCD sum rules evaluation of /^ 2 was undertaken in |78j and 



was claimed to yield the result 0.1 ± 0.05 GeV 2 ; irrespective of the central value, the 
quoted error bars are clearly unrealistic. 

The expectation value of the kinetic operator was estimated in the quark models, 
with also a controversial spectrum of predictions: the nonrelativistic ISGW model 



was stated to predict for /i 2 the value 0.27 GeV ; the relativistic quark model |79 
gave about 0.6 GeV 2 or even slightly larger; the estimates of Ref. f8(J yield a close 
value 0.5 GeV 2 . 

The origin of the difference between the QCD sum rule evaluations [75, [7(| and 



f7T| was explained in detail in ]4T[| . While Ball et al. considered the sum rules for /z^ 
directly, the recent approach by Neubert calculated some other transition amplitude 
which, however reduces to the expectation value of the kinetic operator by equations 
of motion. In this respect, these two methods a priori are equally justified and must 
have yielded the same result provided the calculations were exact. However, the 
techniques of the QCD sum rules is based on certain approximations, and they are 
quite different for these quantities. In particular, it refers to the contribution of the 
excited states which is a kind of a theoretical background, and has to be suppressed 
as much as possible. 

Not much is known about these contributions in QCD. The simple model con- 



siderations suggest [|l[] that the bias can be more significant for the sum rules of 
|7S| than in the traditional approach by Ball et al. The general QCD sum rule 
experience also states that the sum rules operate in better environment when there 
is a continuum contribution imitating the excited states than when it is absent (as 
in the correlators considered in ||78|| ). The actual scale of associated uncertainty in 
/z 2 in each approach remains questionable. It is obvious that the actual uncertainty 
in the analysis of is grossly underestimated and in reality must be increased by 



a large factor. The error bars quoted in |76[ may also require some revision. 

Before mentioning the problem of the quark models, let me briefly discuss the 
inequality /x 2 > y? G . Its physical meaning will be explained in Sect. 3.4.1. The most 
transparent way to illustrate it is based on the sum rules for /i 2 and fi^ [ST], ffTJ : 



,.2 



/_^ c nl'l/2l ~ Z-^i ml 3/2 



2E£W3l a ' (86) 



,2 



^ =-2E^/ir+2E4j<2 } r, (87) 

" n m 

where rjfy and t!$ are the standard notations for the transition amplitudes of the 
ground state B meson to the excited 'P-wave' states with spin of the light cloud j — ~ 
and |, respectively (for explicit definition see, e.g., pip. The excitation energies 
€k are defined as e& = M H (k) — Mb- The products e^r^ are merely the matrix 
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elements of the momentum operators tt = b(iD )b(0) between the ground state and 



the corresponding excited states [18], . These sum rules belong to a whole family; 
I will need later another, so-called optical sum rule for the difference between 
the meson and the quark masses: 

A(f,) = 2he n \r^ + 2^eJr^A . (88) 



The two first sum rules immediately lead to 

'1/2 



= 9£4kSi 2 > 0. (89) 



In QCD both /x 2 and Hq depend on normalization point \x. The inequality holds for 
arbitrary fi as long as renormalization of both operators is introduced in the same 
way 

The values of r's, in particular, t\i% are not well known at present. The QCD 



sum rules |83| as well as the ISGW model jOJ predict for the lowest states t± 



/2 



0.25 to 0.3 while the corresponding e% is expected to be about 400 MeV. Assuming 
/4 « 0.4 GeV 2 we then have Q /i 2 > 0.5 GeV 2 . 



3.3.1 Kinetic operator in the nonrelativistic models 

Let me briefly describe the situation with the nonrelativistic quark models. The cal- 
culation of the average square of the spacelike momentum in the ISGW model yield 
a value close to 0.27 GeV 2 , notably smaller than the model-independent bound Q 
Eq. (]8Qf) . The reason is that in the nonrelativistic systems the chromomagnetic field 
B is much smaller than p 2 and the spin-orbital interaction is negligible. As a result, 
one additionally has the spectator spin degeneracy of the light cloud states, and 
e i/2 = £3/2, T1/2 = T3/2 for all excitations. The sum rule ( |STD would yield /j,q ~ in 
agreement with vanishing B in the hadronic scale. 

The nonrelativistic constituent quark models typically yield reasonable descrip- 
tion of actual mesons despite their admittedly simplified nature. Why does it fail 
here? The actual reason lies in an implicit assumption of what is the counterpart 
of the kinetic operator in these models. In QCD /x 2 is the expectation value of 7? 2 , 
and 7T = iD — A, with the vector gauge potential A ~ Aqcd- in the nonrelativistic 
models A <C A ~ Aqcd and is neglected, so that the identification of the covariant 
momentum with the ordinary momentum operator 

tt -> id = p , nl^ (p 2 ) (90) 

13 In the nonrelativistic oscillator model the transition amplitudes vanish for all but the first 
excitations. 

14 In the field theory where the perturbative radiative corrections are included, certain alternative 
definitions of quantum operators O^, Oq differing in subtraction of the perturbative effects, may 
modify the inequality, ft cannot be violated, however, if the radiative effects are absent or neglected, 
as in the quark models. 
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is assumed as a self-evident thing. However, it is by far not as harmless assumption 
as it might seem. Since the momentum operators commute, one has 



Quark model: [7Tj,7r fc ] = 0, (91) 

whereas in QCD 

2 

[iij^k] = iG jk = -ie jk iBi ~ --ie jM ji ■ OAGeV 2 . (92) 

The fact itself of noncommutativity of the momentum operators leads to the stated 
lower bound on /z 2 , and this noncommutativity is an exact statement of QCD. The 
nonrelativistic quark models like the ISGW model, identifying 7Tj with pi, violate 
this commutation relation. This is as dangerous as modifying the basic uncertainty 
principle of QM [xj,pk] = ih5jk- 

This, in fact, is the main problem of quark models: the chromomagnetic field is 
experimentally known to be of primary importance for the properties of the light 
cloud in B mesons, but it does not naturally fit the nonrelativistic description of 
valence quarks. 

Can the utility of the models like ISGW proved to be reasonable for the wealth 
of other hadron properties, be rescued? In my opinion, the sum rules provide such a 
possibility even without actual rebuilding the model like changing its Hamiltonian 
or explicitly introducing the gauge potential. Indeed, we do not expect the model 
to yield grossly distorted values of the transition probabilities r 2 or the excitation 
energies e&. Accounting for the relativistic spin-orbital interaction is rather expected 
to reasonably split the values of ej% and Tj% from e^) 2 and which otherwise are 
equal. The sum rule Eq. (^7]) says that, most probably, the model systematically 
overestimated the value of e^2 r i/2 and/or underestimated e^2 T 3/2 '• Let us denote 

Edtog?! 8 = (i + «)E4K ( /1r, «>o, (93) 

m n 

where a is a positive constant, presumably of order 1. Next, we assume that the 
ISGW model approximately correctly evaluated the average sum of the products 
| e ( fe ) r ( fe )| 2 # By virtue of the sum rule (|86|) it amounts to 

(3 + 2a)^e 2 |r 1 ^| 2 « ^ SGW « 0.09GeV 2 . (94) 

Eq. (H) then yields 



A - A + • 05) 

Even varying the value of a from 0.5 to 3 we get /x 2 ranging from 0.5 GeV 2 to 
0.6 GeV 2 (I put = 0.4 GeV 2 ). 

Of course, the assumptions made above are not very rigorous. Nevertheless, I 
believe that such a way to cure the deficiency of the nonrelativistic constituent quark 
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models is reasonable. It is fair to say that the value of /i 2 in the QCD-compatible 
model motivated by ISGW is (0.5 ± 0.1) GeV 2 , unless one is ready to accept that 
the ISGW model provides a strongly distorted description of B mesons. 

It is important to keep in mind that in actual QCD the expectation values of 
the kinetic operator also depend on the normalization point, the effect which is not 
present in the simple quark models. The prediction obtained in such models refer 
to a relatively low normalization point fi « 0.7 to 1 GeV. In HQET an analogue 
of /x 2 is called — Aj. If the light cloud in the hadrons were a simple QM system 
without actual gluonic degrees of freedom, then — Ai would be equal to /i 2 . The 
short- distance (perturbative) effects distinguish them making /i 2 scale-dependent. 
In contrast to /i 2 , — Ai is postulated to be a fundamental scale-independent quantity, 
from which "all perturbative contributions" are subtracted. In this respect, it is a 
direct analogue of the pole mass of the heavy quark, and is ill-defined as well. It 
makes no sense to state the exact value of — Ai since it can be chosen arbitrarily 
within a few units times Aq CD . On the contrary, /x 2 (/x) is we U defined at all /x. At 
/i ^> Aq CD its /i-dependence is given by perturbation theory [pX 



dt£(n) = 4 a a (e- 5 / 3+ln V) 

d/I 2 3 7T 




(96) 



As a matter of fact, a constructive definition of — Ai has never been given. What 
is used in the literature is an implicit assumption that one must subtract from 
/•4(a0 its formal perturbative part, that is, integrate Eq. ( |9"B"D down to ji = order 
by order in a s . It is well known that such a procedure cannot be performed correctly, 
the resulting series is divergent while its sum is meaningless and the value of — Ai 
strongly depends on the way one viciously defines the sum or truncates the series. 
To illustrate it numerically, for // = 1 GeV the series would look like 

4 Ot 

- Ai = /4(/i) - ... = /i 2 (1 GeV) - 0.12 GeV 2 - 0.36 GeV 2 - ... ; (97) 

3 7T 

the higher-order terms grow and constitute already in the low orders shifts larger 
than the quantity in question. Certainly, the number assigned to — Ai can be arbi- 
trary compared to fi G : no rigorous bound can be derived for the quantity which is 
not possible to define. 



3.3.2 Hard QCD and normalization point dependence 

The sum rules (p6|)-(p7|) express /i 2 and fj, G as the sum of observable quantities, 
products of the hadron mass differences and transition probabilities. The observable 
quantities are scale- independent. How then /i 2 and fi G happen to be /i-dependent? 

The answer is that in actual quantum field theory like QCD the sums over 
excited states are generally UV divergent when Aqcd, and not saturated by 

a few lowest states with contributions fading out fast at large as in ordinary 
QM. The contributions of states with ^> Aqcd are dual to what one calculates in 
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Figure 3: Perturbative diagrams determining the high-energy asymptotics of the heavy quark 
transition amplitudes and renormalization of the local operators. 



perturbation theory using its basic objects, quarks and gluons. The latter, of course, 
yield the continuous spectrum and the corresponding transition probabilities do not 
depend on the initial bound state. They can be evaluated perturbatively using 
isolated quasifree heavy quarks as the initial state. The final states are heavy quarks 
and a certain number of gluons and light quarks. The difference between the actual 
hadronic and quark-gluon transitions resides at low excitation energies. 

In order to make the sum rules meaningful, we must cut off the sums at some 
energy /i which then makes the expectation value /i-dependent. The simplest way 
is merely to extend the sum only up to e n < \x\ this is the convention I consistently 
use. For analytic computations it is often convenient to use the exponential factor 
e~ tn ^, which is essentially the Borel transform of the related correlation functions. 
The concrete values of //^ and p? G depend, of course, on the adopted scheme. Since 
at large /i the cutoff factors differ only in the perturbative domain, the difference 
between various renormalization schemes can be calculated perturbatively. 

The high-energy tail of the transitions to order a s is given by the quark diagrams 
in Figs. 3 with 

k 



2uj 



where (u, k) is the momentum of the real gluon. The amplitudes are just a constant 
proportional to g s , and performing the simple calculations we arrive at the first- 
order term in the evolution of Eq. (|96|) . Purely perturbatively, the continuum 
analogues of T\n and r 3 / 2 are equal and a similar 'additive' renormalization of [Mq is 
absent. 

The perturbatively obtained evolution equations (0), (|82| ) allow one to deter- 
mine the asymptotic values of ri/ 2 and r 3 / 2 at e ^> Aqcd: 



E 4K ( /1l 2 +2 E jp^A, (9f 

[i<e n <ti+A /j<e m </i+A 



E 2 |_M|2_ 2i _(«)|2 ^ _WsW \ ^ 2 |_( m )|2_ 2i (")|2 

t m\'3/2 I t n\'l/2\ ~ c> . \ ml ' 3/2 I t n\'l/2 



(99) 



48 



The minimal value of A, a duality interval depends on the considered energy. When 
/i is in the resonant zone, only a little above the first prominent resonances, A is of 
order Aq CD . At large values of /i where the spectrum is smooth, A can be taken 
smaller. 

Such asymptotic equations, even incorporating higher-order corrections, are valid 
only for large enough //, above the onset of duality where the perturbative expres- 
sions are applicable. Although formally /i 2 (/-0 and Hail 1 ) are defined at arbitrary 
fi, below a certain scale their running is completely different from the perturbative 
renormalization. In particular, they both vanish when /i — > while staying finite 
for any value of ji. It is in a sharp contrast with the HQET definition of, say, /^(/i) 
(it is called 3A2) which would exhibit a singularity at some value of \i where the 
MS coupling approaches the Landau singularity. The actual running of /^(/-O nas 
nothing to do with the formal perturbative expression — 7G/i^(/ i ) a ^ small fi. The 
same general remark refers also to the definition of the Isgur-Wise function. 

3.3.3 Where is the caveat? 

Although the inequality /i 2 (/i) > /^(aO holds for any /i, it may, or may not be 
practically informative. The value of accounting for the so-called "condensate" 
nonperturbative corrections when the perturbative effects are present is a dynamic 
question of QCD, and it cannot be answered a priori by only mathematical manip- 
ulations. The dynamic challenge emerges for the sum rules as the question at which 
scale duality sets in, that is, what is the minimal value of /i where the perturbatively 
obtained expression for the amplitudes (in particular, /i-dependence) start to hold 
for the actual hadrons to a reasonable accuracy. For example, at large enough /i the 
inequality /i 2 (/i) > /^(aO * s trivial: the kinetic expectation value is then dominated 
by the large perturbative piece while Pq{p) ~ Aq CD . At which minimal 

p the value of p%(p) is already a quantity about 0.4 GeV 2 ? We can be reasonably 
confident that at p as large as ~ 4 GeV it holds. Is it still as significant at the 
scale p ^ 1 GeV, or its value is mainly saturated at e& > m c , we do not know for 
sure. This is the basic question we must learn to determine the applicability of the 
heavy quark expansion. 

It is tempting to optimistically interprete the well-satisfies experimentally rela- 
tion 

M|,. - Ml ~ Mf . - M 2 B 

suggested by the 1 / m c h expansion as an evidence for a safe early onset of duality. It 
should be noted, however, that the same difference persists also for K* — K and even 
p — 7r splitting, which certainly cannot be attributed to a heavy quark symmetry 
and must have an independent dynamic origin. The question, therefore, remains 
open. 

If, alternatively, the saturation is delayed, one cannot count on any quantitative 
application of the heavy quark symmetry to charmed hadrons, including the sym- 
metry relations for the formfactors. I consider the question of how the sum rules 
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for [Xq (and the similar sum rules for Mb — M^ b — m^) are saturated one of the 
most topical issues in the HQE. The first step toward obtaining the answer from 
experiment has been recently done in [ 83| . The uncertainties are still large, how- 
ever. Only the leading 1/toq terms in the amplitude were considered; all they have 
l/m c corrections expected to be quite sizeable. The outcome is yet inconclusive - 
stretching the uncertainties in one direction it is possible to reasonably saturate llq 
even at as low scale as 600 MeV. The space and time limitations do not allow me 
to dwell on this issue here in any detail. It is very interesting to understand also 
on which hadronic states the significantly larger A for the A& baryon is saturated. 
Unfortunately, the necessary experimental information is practically absent here. 



3.3.4 Theoretical passions around the kinetic operator 

The question of the field-theoretic definition of the kinetic operator used to be a sub- 
ject of controversial opinions in the literature; even now some incorrect statements 
remain. It makes sense to briefly address these problems here. 

A constructive definition of an effective theory and the composite operators in 
it, or defining renormalized operators in full QCD requires eliminating the high- 
energy modes of the quark and gluon fields. It can be done in somewhat different 
ways. For instance, even fixing the normalization scale to be 11 one still can cut off 
modes higher than, say, 2fi or fi/2. Of course, the difference can be more essential. 
This is illustrated in the most transparent way on the example of the perturbative 
corrections. 

Let us consider the kinetic operator Q(iD) 2 Q(0) and keep in mind that itlq — > 
oo. The operator is perfectly defined at the semiclassical level when the gauge 
field is a fixed external (c-number) function of coordinates. However, the quantum 
fluctuations of generate divergent corrections: interacting with the gauge quanta 
the heavy quark acquires a recoil which diverges in the ultraviolet when integrated 
over the frequencies of the quanta. For example, two diagrams in Fig. 4 describe 
the order-a s corrections to the operator whose expectation value generally becomes 
nonvanishing even without any external field: 

(0(^)^(0))^ = \gl{h + h) . 
The corresponding Feynman integrals are 

r d 4 k 1 k 2 r d 4 k 3 

a ~ J (2^P Jfeg ' b ~ ~J (2^F' ( } 

The integrals are quadratically divergent in the ultraviolet and somehow must be 
cut off. 

The UV regularization can be done in the different ways. If one wants to preserve 
the QM description of the effective theory, it must be local in time and no cut over 
the timelike component of ko of the gluon momentum is allowed (a more detailed 
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Figure 4: Feynman diagrams contributing to the one-loop renormalization of the kinetic operator 
Q(iD) 2 Q. Dashed line denotes gluon, and dark box represents the operator. 



discussion can be found in We can thus insert i?(/i 2 — k 2 ) in the integrand, 

that is, cut off over the spacelike components of the gluon momentum. It would 
preserve such natural properties of the effective theory as positivity of transition 
probabilities, etc. - even for momenta up to /i. This is the regularization suggested 
and the explicit expressions I used in these lectures always refer to it. With 



in 

such a regularization 3I a = —lb, and Eq. ( P^| ) is reproduced. 

In calculating the one-loop perturbative diagrams it is often more convenient 
to impose a cutoff as a function of (Euclidean) k 2 - that is, first to perform the 
Wick rotation ko = ik±, and then add the factor $(/i 2 — k 2 ) or any other cutoff 
function of k 2 + k\. For example, this is the way the cutoff is introduced in the 
Pauli-Villars scheme which regulates the integrals subtracting the similar diagrams 
with fictitious gluons having large masses. The direct computation yields that if, 
instead of integrating over ko one first averages over the directions of Euclidean 
momentum k, then I a = —lb, and the perturbative contribution vanishes. 



This fact was noted in |6(J where the authors employed essentially the Pauli- 
Villars-type regularization and observed vanishing of the first-order renormalization 
of the kinetic operator.0 It stimulated speculations that this fact could be not acci- 
dental but have a deep reason which would ensure vanishing of the renormalization 
to all orders. These speculation were intensively lobbied for some time (see, e.g. 
j85fl). In particular, they served a ground for the hope that there existed a "proper" 
regularization scheme in which there were no perturbative corrections to the kinetic 
operator at all, and which thus would provide a possibility to define unambiguously 
an "absolute" value for — Ai, as it is desired in HQET. In reality, there were no 
reason for such a miraculous expectation. Eventually, even Neubert had to admit 
]86] that — Ai is an unphysical and indefinite parameter. 

The proponents of such a possibility blamed the physical regularization described 
above for violating Lorentz invariance - the cutoff over k allegedly is not Lorentz- 
invariant in contrast to the cutoff in k 2 . The nonvanishing renormalization was at- 
tributed to this 'defect'. Such a claim is erroneous in this context, for k 2 is as Lorentz 
invariant as k 2 since there is an intrinsic Lorentz vector in the problem, and 
k 2 = (vk) 2 — k 2 . The idea to invent a "Lorentz-invariant" (that is, f-independent) 

15 Actually, I showed in |65| that this fact holds to all orders in the Abelian theory without light 
flavors, and up to a 3 in QED with light flavors. 
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regularization is misleading since the kinetic operator itself Q {(ivD) 2 — (iD) 2 } Q(0) 
is explicitly v -dependent. To state it differently, already the HQET heavy quark 
field h v (x) itself is not Lorentz- invariant by construction Eq. (TT^), and it obeys the 
f-dependent equation of motion i{vD) h v (x) = 0. 

The question of renormalization in higher orders could have been answered calcu- 
lating the two-loop radiative corrections to the kinetic operator. However, it would 
require first to define a "proper Lorentz-invariant" cutoff procedure. As explained 
above, there is no natural way to define it in general, and it could have been done 
only in the technical way at the diagrammatic level. The naive Pauli-Villars regu- 
larization is not possible since the gluon mass violates non-Abelian current conser- 
vation; the result becomes gauge- dependent starting two loops where the difference 
with the Abelian case first energies. Moreover, the momentum itself of a color ed 
particle (gluon) is a gauge-dependent quantity. Therefore, the requirement of a 
"Lorentz-invariance" of the cutoff explicitly conflicts with gauge invariance. Physi- 
cally, the problem is quite evident: it is impossible in the non-Abelian theory with 
self-interacting gluons to separate the virtual gluons from the static Coulomb field 
of the heavy quark, and the latter is Lorentz-noninvariant. 

Thus, there is no alternative definition of the kinetic operator in QCD beyond 
the first loop which would be free of "Lorentz-noninvariance" of the physical scheme 
of Ref. @. 

An analysis of the two-loop renormalization of the kinetic operator was carried 
out recently by Neubert [|86| in a scheme which was claimed to be free of these 
problems, in the contradiction with the no-go arguments given above. A closer look 
reveals, however, that the analysis is wrong in the basic aspects. 

According to the expectations, the two-loop diagrams with the non-Abelian gluon 
interaction did not vanish for any symmetry reason and led to the divergent Feynman 
integrals which had to be regularized. To assign them a definite meaning the author 
used a dispersion integral with the cutoff over the energy of intermediate states. 
It was failed to realize that it was exactly the same regularization procedure that 
is consistently used in the physical definition criticized by the author, and thus 
should have been considered not invariant to the same extent. The claims of its 
"Lorentz-invariance" were wrong. 

More to the point, contrary to the assertions of the paper the results are gauge- 
dependent. It was forgotten that the integrated discontinuities are gauge-invariant 
only if the external color particles are on-shell and only if all diagrams were in- 
corporated. Both these requirements were not respected in the "regularization" of 
Ref. |S!J . In particular, the "vanishing" diagrams discarded for symmetry reasons - 
if regulated in the same way - would yield nonzero contributions. 

As a result, the procedure used in Ref. ISEj cannot even be considered as a 
consistent regularization scheme. Leaving aside the gauge-dependence, one cannot 
use different schemes for the order-a s and order-a^ corrections or apply different 
cutoff rules for different groups of diagrams in the single order. Moreover, it must 
be realized that the regularization of the part of non-Abelian two-loop diagrams was 
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performed in the same dispersion method as suggested in [18|], which would lead to 
non-trivial renormalization of the kinetic operator already to order a s . The whole 
analysis then makes no sense and is irrelevant even if were made consistently. 

Summarizing the "puzzle" of the perturbative renormalization of the kinetic op- 
erator, the question of whether there exists any motivated 'invariant' non-Abelian 
regularization beyond the first loop, and if yes then whether the operator is renor- 
malized in this scheme, remains open. The claims of answering it have no grounds. 
Personally, though, I do not think it has any practical relevance, and am not sure 
in the theoretical relevance either. 

A completely defined regularization scheme was suggested and discussed in (18], 



65], 0; another possibility is using lattice versions of QCD with heavy quarks. 
The latter is not "Lorentz-invariant" in the discussed sense either. Moreover, it 
manifestly violates the rotational invariance. The cutoff over the gluon momenta 
in the physical scheme we adopt does not look in high orders as simple as in the 
first loop. Moreover, it necessarily has a nontrivial form for particular graphs to 
be gauge-invariant. The renormalized operator itself is manifestly gauge-invariant 
since can be written in terms of the observable transition probabilities. Finally, the 
technical feasibility of such a renormalization scheme was demonstrated in |87|, |22fl 
where the complete two-loop renormalization of the kinetic operator was computed. 



3.3.5 Regularization and renormalization 

Continuing a more theoretical part of my lectures, I feel necessary to discuss the dif- 
ference between regularization and renormalization. The relevance of this question 
is not confined by the heavy quark theory and is more general. The confusion among 
these two notions is particularly apparent in heavy quarks, and often is the reason 
behind contradicting statements, especially regarding the role of the dimensional 
regularization. 

In a few words, the regularization is used as a technical tool to make all interme- 
diate results of the calculations finite when a finite final answer is split, in the process 
of calculations, into simpler pieces which separately are divergent. Therefore, any 
consistent regularization is possible and nothing depends on it. Renormalization is 
the actual eliminating of actual divergences constructing an effective theory with 
a real ultraviolet cutoff. All effective couplings and operators do depend on the 
renormalization procedure. I'll try to explain it here in more detail. 

QED, QCD and the SM in general are renormalizable theories (strictly speaking, 
they are perturbatively renormalizable). It means that, fixing a number of observ- 
ables corresponding to the bare parameters of the Lagrangian we can, in principle, 
determine other observables. For example, in QED we fix m e , a em (0) defined viathe 
static Coulomb potential at microscopic distances, etc. All interesting cross sections 
and probabilities can be expressed in their terms. In renormalizable theories all such 
relations are eventually finite. 

Technically, however, in perturbation theory we usually proceed indirectly. Namely, 
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the observables are calculated via the (bare) parameters of the Lagrangian, and then 
the quantum corrections associated with loop effects are typically UV divergent. 
This reflects the fact that the complete definition of the theory requires introducing 
an ultraviolet cutoff, and the bare couplings depend on this cutoff in a divergent 
way when Auv ~~ * oo. 

At the same time, the relations between the physical input parameters charac- 
terizing the theory and the bare constants are divergent as well. Renormalizability 
means that the divergencies here are exactly the same and cancel when the two 
steps are performed together. The presence of the ultraviolet cutoff in any order of 
perturbation theory affects the final result only via inverse powers of Auv, so that 
the dependence on it disappears when Auv ~^ 00 • 

Subtracting or dividing infinities is not a correct mathematical procedure per se 
and, if not done accurately, may lead to an arbitrary result. The standard way is to 
regularize the involved Feynman integrals to make them finite - though dependent 
on the UV regulator - at any stage. The regularization involves certain modifications 
of the theory - for example, introducing extra heavy particles in the Pauli-Villars 
scheme. As long as it is ensured that the final physical relations do not depend on 
them, such modifications are legitimate. The regularized expressions can be safely 
manipulated since they are well-defined numbers. The final relations appear to be 
finite and regularization- independent, all infinities must cancel. The regularization 
can be removed at the final stage. 

The regularization is thus a purely technical element inherent to the standard 
way to treat the perturbative integrals. It is related to splitting the finite, convergent 
expressions into separately divergent pieces which are merely easier to calculate. All 
regularization schemes are equally acceptable since the result does not depend on 
them. It must be stressed, however, that we speak here about the observables. 
Similar regularization procedure is often applied also to infrared problems when an 
observable is split into parts which separately can be IR divergent. 

One of the most technically convenient regularization schemes is dimensional 
regularization (DR) [RSI. Its idea is to calculate Feynman diagrams in a theory 
formulated in arbitrary dimension D where the integrals are well-defined. It has an 
additional advantage of preserving gauge-invariance at any value of the regulariza- 
tion parameter e = 4 — D. 

Quite different problems must be solved when one needs to construct an effec- 
tive theory integrating out the UV degrees of freedom above a certain scale \i. This 
procedure really modifies the theory at the scale fi and above. For example, the 
spectrum of a lattice gauge theory, while describing actual hadrons at small ener- 
gies, has little in common with the perturbative QCD at energies E > 1/a. All 
composite operators in the effective theory are well-defined and do not suffer from 
divergences. They are renormalized. Even the literal perturbative expansion does 
not require (UV) regularization in the effective theory at any step. The renormal- 
ization depends, on the contrary, on the way the cutoff is introduced. The bare 
couplings, e.g. af^ are concrete finite numbers in the effective theory, depending on 
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n . 

Since all objects of the effective theory are finite, it can be used also to regu- 
larize the perturbative calculations. Although the result explicitly depends on //, 
for the renormalizable bare Lagrangians all the dependence resides in the terms 
inversely proportional to powers of \x and disappears at /j, — > oo. Moreover, it is 
often convenient to use cx s (fj), m q (ii) etc., in particular in QCD as an intermediate 
parameters. 

However, can one really use the standard regularization techniques in the op- 
posite direction, namely, to construct an effective theory and define renormalized 
operators? The answer depends on the scheme and, in general, is "No" . The details 
of regularization are crucial here, contrary to regularization of purely perturbative 
calculations. In particular, I'll demonstrate that the most convenient DR is, as a 
rule, unsuitable for constructing renormalized operators. 

First of all, the main application of the effective theories (say, for heavy quarks) is 
nonperturbative effects. While the DR perfectly handles regularization of Feynman 
integrals generated by the perturbative expansion of the formal QCD functional 
integrals in arbitrary (complex) dimension, it is so far impossible to formulate QCD 
as a complete quantum field theory including nonperturbative effects in non-integer 
dimensions. Even if it were done, the physical point D = 4 would have been 
a point of essential singularity and had extremely complicated analytic structure 
in e reflecting various phases of the theory. It would not be possible to define 
extrapolation to D = 4 subtracting only a few simple rational functions of D. 

Thus, it is clear that the DR as a purely perturbative construction helps nothing 
in defining the nonperturbative operators in an effective theory - a task which, say, 
the lattice regularization does successfully. One can try to limit consideration to 
only the perturbative expansion. There the DR proved to be very useful in many 
aspects like applications of renormalization group. However, even perturbatively 
the DR typically fails to perform the renormalization of operators in the way nec- 
essary for the Wilson OPE. The only exception are operators which are not subject 
to (powerlike) mixing with lower-dimension operators and have only logarithmic 
anomalous dimensions. An example of such operator is the chromomagnetic oper- 
ator Oq. Their perturbative normalization is given by the (Euclidean) integrals of 
the type 

r d A k „ 1 r k 2 dk 2 9 1 . . 

The denominators are in fact somewhat different in the infrared since they include 
external particle momenta p cxt , and the integrals can be IR convergent. However, 
they are logarithmically divergent at k 2 — > oo. The DR, roughly speaking, multiplies 

■D-4 

the integrand by (k j \i ) 2 : 

= 1 (S ^ D °'h = l wmSm ^ D f ■ (102) 
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At KeD < 4 the integral becomes convergent but contains the pole 1/(D — 4). 
The DR in the perturbative calculations suggests merely to subtract from Jd(//) 
this pole. The minimal subtraction scheme of the DR, for example, subtracts the 
combination 1/(D — 4) + | — | ln47r containing the Euler-Mascheroni constant: 

= JdM - ^ + 7 - ln47r) . (103) 

In the complete perturbative calculations this UV pole must finally cancel in ob- 
servables, and ad hoc removing it when passing to J R (fi) does not change the final 
answer at D — > 4. In the renormalization procedure, on the contrary, we want to 
define such integrals themselves. The quantity J R (fi) = lirno-^ J§(fj) is finite and 
is the renormalized integral in the MS scheme. 

Does J R {fi) has a more transparent meaning? Yes, it is easy to see that it 
approximately coincides with the original integral Eq. ( |101| ) if the integration is cut 
off in the UV at k 2 ~ lj 2 : 

fk 2 <fi 2 ^k I 

J R {») = J j^y 9l ^ with jl 2 = e • /i 2 > p 2 ext (104) 

(the relation between fj, and fx is not universal and depends also on the order of 
perturbation theory). Thus, for logarithmic integrals the DR effectively works as an 
UV cutoff in the effective theory, at least in the low orders. 

Unfortunately, in all other cases the DR fails as a renormalization procedure. It 
can be seen on the example of Feynman integrals encountered in renormalization of 
the kinetic operator or mass of the nonrelativistic particle, the QCD operator G 2 ^ 
etc. which are power-divergent. Already in one loop simple power integrals of the 
types 

&=l£kfA> I^gI, (105) 



(2tt) 4 " s k 2 ' J 167T 2 

etc. appear. They are not regularized by an infinitesimal shift of the dimension. 
The expressions can be formally defined only at complex D far away D = 4, at 
KeD = 2, KeD = and so on (depending on the dimension of the operator), where 
they actually vanish. The only conceivable way to define such integrals as analytic 
functions of D is to have all them zero. This can be anticipated a priori: these 
integrals must have a positive dimension of mass which can be provided only by \x 
accompanying the coupling g s at D ^ 4. However, D = 4 is not a singular point for 
such integrals. 

A direct way to see this fact is to analyze how these 'renormalized' integrals 
would enter calculations of observables. They describe the IR part of the complete 
Feynman integrals in the domain k <ti mq. Since the latter are safely IR conver- 
gent, regularization merely does not change the IR part and, therefore, includes it 
completely. No room remains for the additional contribution of such operators in 
perturbation theory. 
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As a result, except for the special case of the logarithmic renormalization, the 
DR merely considers the perturbative integrals for the operators zero irrespective 
of any details, and this corresponds to setting the Wilsonian cutoff \i = 0. In 
reality in higher loops, for example in the kinetic operator, the resulting integrals 
by themselves are not sensibly renormalized at any D, even at KeD = 2. 

To summarize this digression into the theoretical subtleties, one should clearly 
distinguish regularization and renormalization which mean quite different things. 
Renormalization introduces nonvanishing corrections even in the finite quantities or 
theories, where regularization is not necessary. While the DR is a consistent and 
one of the most technically convenient regularization schemes, it is unsuitable for 
renormalization of composite operators in the effective theories. 



3.4 \V c b\ from the Zero Recoil Rate of B — > D*lv 

The concept of the heavy quark symmetry was very important for the evolution of 
studies of heavy flavor hadrons. The fact of the fixed normalization of the B — > 
D and B — > D* formfactors at zero recoil in the limit c — > oo was of special 
significance in applications since suggested the method for determinations of \V c b\ 
from the exclusive B — ► D* semileptonic transition near zero recoil. To this end one 
measures the differential rate, extrapolates to the point of zero recoil and gets the 
quantity |V^,.Fd*(0)|, where Fd* is the axial B — *> IvD* formfactor. Since the charm 
quark is only marginally heavy, it is very important to estimate the corrections, in 
particlar, nonperturbative. The exclusive transition amplitudes are not genuinely 
short-distance, such transitions proceed in time intervals ~ 1/Aqcd- Nevertheless, it 
turns out that the large-distance effects appear in these kinematics only suppressed 
by 1/m 2 



c,b- 



a. 



7T 



F D *(0) = 1 + O ( -± ) + 5? /m . + 8? /m3 + ... (106) 



The absence of I/tuq corrections was first noted in Ref. |2t|.0 As far as I 
understand, the reasoning used by Shifman and Voloshin was very simple: let us 
consider, for example, the vector B — > D transition at zero recoil: 

(D\c lo b\B) = (M B + M D ) (l + — - — + ...) (107) 

(short- distance effects are neglected). The relative magnitude of l/m& and l/m c 
terms is fixed since at m;, = m c all corrections must vanish identically. T-invariance, 
however, says that the coefficients for l/m c and 1/rnj, terms must be equal since B 
differs from D by only the value of the heavy quark mass. Thus, both terms must 



vanish. This observation was later studied in more detail in |90| and is often called 
Luke's theorem. It essentially improves the credibility of this method in spite of 
certain experimental difficulties. 



16 The first preprint version of the paper [p9| discussing the heavy quark symmetry in QCD was 
given to me by M. Shifman in July 1986. 
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The task of precision determination of | V c b\ from the exclusive transition requires 
a detailed dynamic analysis of various preasymptotic corrections in Eq. ( |106|) . The 
perturbative part, albeit technically complicated, is at least conceptually transpar- 
ent. The theory of power corrections is more challenging. 

The need in evaluation of the 1/mq corrections in Eq. ( |106| ) for practical pur- 
poses was realized quite early |L4[]. In these days the theory of the power corrections 



in heavy quarks was immature, our knowledge was scarce, so that it was hard to 
decide even the sign of Sy m2 . The existed opinion stated that the deviations from 
the symmetry limit must be very small, and suggestions that they could be as large 
as 10% were categorically refuted f!M. The situation as it existed by 1994, before 



the OPE-based heavy quark expansion was applied to the problem in pi , |18|1 was 
summarized in the review lectures M: 



rj A = 0.986 ± 0.006 5^ /m2 = (-2 ± 1)% (108) 

(t]a is a purely perturbative factor introduced for this transition in HQET), yielding 
Fp* ~ 0.97 with a very small error. This estimate was promoted to "one of the 
most important and, certainly, most precise predictions of HQET" . Nowadays we 
believe that the actual corrections to the symmetry limit are larger, and the central 
theoretical value lies rather closer to 0.9 pi, |l8j. After heated debates for a few 



years the estimates we suggested in April 1994 seem to be accepted in the literature. 



Regarding the perturbative calculations per se, it was later pointed out |9T| that 
the claimed improvement of the original one- loop estimate was incorrect, and the 
proper central value is rather i]a ~ 0.965; such a value was confirmed by complete 
two-loop 0(a 2 s ) computation |93| . 



The existing estimates of the power nonperturbative corrections in Fp* are based 
on the sum rules for heavy flavor transitions |2~T| , [T8|| . The validity of the sum rules 
had been questioned from various perspectives, but now is commonly accepted.^ 
The idea of the application of the sum rules to B — ► D* was to consider the "in- 
clusive" transition probability into all possible final states and not limiting them to 
only D* we are really interested in, Fig. la. Studying the zero-recoil transition we 
only fix the spacelike momentum q carried away by the lepton pair, q = 0. Since 
such a sum is an "inclusive" probability, it is a short-distance quantity and is directly 
expandable in the OPE in powers of l/m c ^] the relation is established applying the 
usual tools - analyticity and unitarity. The idea and technology of the heavy quark 



sum rules were recently reviewed in [[41] and discussed in detail in the original paper 
||18|| . Here I mainly dwell on the results. 

Schematically, the zero-recoil sum rule for the axial cryfr current responsible for 



17 Neubert, for example, claimed that the sum rules were wrong (49[ [T(J while simultaneously 
using them. It does not take long to see why those arguments are invalid. Nevertheless, so far he 
has not retracted these claims. 
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the B — > D* transition has the form pl|, [18 



D* 



+ £ 

0<ei<fi 



Ufa) 



A A 



+ o 



m. 



where 



vlfa) , t4fa)-t*cfa) 

3m 2 . 4 



1 



mi 



mi 3m c m\ ) 



(109) 



;no) 



Here Fi are the-axial current transition formfactors to excited charm states i with 
the mass Mi = Mp* + e i: and £a is a short- distance renormalization factor, Ufa) = 
T]\fa). Contributions from excitations with e higher than /x are dual to perturbative 
contributions and get lumped into the coefficient Ufa) of the unit operator, the first 
term in the right-hand side of Eq. ( |109| ) . 

The role of /i is thus two-fold: in the left-hand side it acts as an ultraviolet 
cutoff in the effective low-energy theory, and by the same token determines the 
normalization point for the local operators; simultaneously, it defines the infrared 
cutoff in the Wilson coefficients. 



The l/m,Q corrections to the sum rule A^ m3 are also known |^T|. The short- 
distance renormalization factor Ufa) is calculated perturbatively. To the first order 
it was computed in all-order BLM resummation performed in ||65|| . The most 
technically involved part of genuine 0(a 2 ) corrections to Ufa) was computed in 
PB| , and the complete result given in j57|. The overall short-distance normaliza- 
tion appears to be very small (this is the result of certain numerical cancellations), 

Va(p) : 



<£a(ji) ~ 0.99 for fx ~ 0.6 GeV with the uncertainty at a percent level. The 
Wilson coefficient for the kinetic operator was also computed to the next-to-leading 
order in the correction to it is small. 

The sum rule Eq. ( |109| ) leads to the upper bound: 



D* 



Ufa) 



0<ei<fi 



FA 



°l/m 2 



> 



> 



8m 2 . 



0.035 



111^ 



The last relation is a model-independent lower bound for the 1/m 2 corrections to 



Fd* at zero recoil |2T 



I pause here to make the following remark. The sum rule Eq. (|109|) is the most 
obvious way to see the absence of 1/m 2 IR corrections at zero recoil. The sum 
of the transition probabilities has only the calculated l/m 2 cb corrections; no 1/tuq 
corrections can appear merely because there is no nontrivial local operator with 
D = 4. The excited transition amplitudes are due to the mass- dependent terms in 
the Hamiltonian, so they start with I/ttiq; the probabilities are then only l/m 2 Q. 
A slightly different (fixed q 2 , that is, a SV rather than zero-recoil) version of this 
argument was implied by Voloshin and Shifman in mid-80's when noting the absence 
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of 1/ rriQ corrections. In is curious that the fact itself of calculability of this inclusive 
probability was not discussed in this context, however. A formal QCD derivation not 
appealing directly to an effective heavy quark Hamiltonian or heavy quark symmetry 
was given in 



11 



Such a way based on the sum rules is advantageous since clarifies the subtlety 
appearing in the presence of QCD-inherent radiative corrections: the masses used 
here must be short- distance, not pole masses. This differs from the HQET approach. 
The perturbative /i-independent factor t]a as it is defined in HQET has an infrared 
(9(AQ CD /m c ) part via the dependence on masses. Likewise the difference Fp — t]a 
(or analogous ratio) does have 0(a s /m c ) long-distance contribution merely since tja 
has it. The Luke's theorem is not valid as it is literally understood in HQET. 

One can consider the similar sum rules for q ^ 0, or for other weak currents. 
For example, for the transitions driven by the hypothetical zero-recoil pseudoscalar 
current J5 = J d 3 x (cijsb) (x) one obtains the sum rule 



(l/2m c - l/2m 6 ) 2 Ufa) - ) >0 



112) 



This is the field-theoretic analogue of the difference of the sum rules (p6|) and 
and of the inequality discussed earlier. 

Returning to Fjj* , if we want to obtain an actual estimate of the nonperturbative 
corrections rather than a bound, we need to know something about the contribution 
of the excited states in the sum rule Eq. (|109|) . Unfortunately, no model-independent 
answer to this question exists at present. The best we can do today is to assume 
that the sum over the excited states is a fraction \ °f the local term given by 
and /1q, 



£l*il 



X ^f/m 2 



;ii3) 



where on general grounds \ ~ !• The contribution of the continuum state can be 



calculated |21| , however theoretically it is expected to constitute only a small fraction 
of the sum over resonant states. Trying to be optimistic, we rather arbitrarily limit 
X by unity on the upper side; the larger is \i the smaller is Fd*. In this way we 
arrive at 



D* 



r] A {^) - (1 + x) 



/4 + /4_ 



6m^ 



1 1 



3m c mb 



°l/m 3 



;ii4) 



Assembling all pieces together we get for % = 0-5 ± 0.5 



Fn* 



0.91 - 0.013 



^_0.5GeV 2 
0.1 GeV 2 



± 0.02 cxcit ± 0.01 pert ± 0.025 1/r . 



;ii5) 



Estimates of the uncertainties in the 0(l/mq) corrections and the contributions 
from the higher excitations are not very firm and are rather on an optimistic side; 
they can be larger. With l/mn corrections amounting to ~ 8% noticeably smaller 
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1/m 3 effects can be only a result of accidental cancellations. Their more elaborated 
estimates were done in ||65|| . Altogether we get 



D* 



0.91 ±0.06, (116) 



where the optimistic uncertainty ±0.1 GeV 2 is ascribed to /j, 2 . 

Eq. (|115| ) basically coincides with the estimates given in the original paper J2TJ . 
The QCD-based analysis definitely favors a significantly larger deviation of Fd*(0) 
from unity than those en vogue three or four years ago. Trying to understand the 
reason behind the difference, I carefully studied the earlier analyses claiming smaller 
effects. They all ascend to the paper H]; its subsequent usage was somewhat too 



liberal, I would say. The analysis of |I4[ was not actually a kind of a reasonably 



accurate evaluation but rather a general discussion of the scale of possible effects not 
pretending on high precision. The effects of the relativistic spin-orbital interaction 
in light cloud were practically ignored since they did not fit the oversimplified quark 
model used for the estimates. I think that this was even more essential than the fact 
itself that the 1 /mg expansion was performed incorrectly at order 1/ttlq in [14|. It 



was only in the successive reviews by Neubert where a tentative evaluation of that 
paper was gradually promoted to the status of the accurate prediction, without new 
relevant input. 

3.4.1 Quantum-mechanical interpretation 

In QM the inequality /i 2 > p? G can be illustrated in different ways. Formally, it 
expresses the positivity of the Pauli Hamiltonian ||77|| : 



JLtfw) 2 = — f(W) 2 --aG 

More physically, it shows the Landau precession of a colored, i.e. "charged" particle 
in the (chromo)magnetic field. Let me recall that the particle Hamiltonian in the 
magnetic field looks formally like for free particle, V 2 /2m, with the only difference 
that the covariant momenta V do not commute in the presence of the magnetic 
field. This 'uncertainty relation' leads to the precession. Hence, one has (p 2 ) > \B\. 
Literally speaking, in the B* meson the quantum-mechanical expectation value of 
the chromomagnetic field is suppressed, (B z ) = — It completely vanishes in 
the B meson. However, the essentially non-classical nature of the 'commutator' B 
(e.g. (B 2 ) > 3(B) 2 ), in turn, enhances the bound which then takes the same form 
as in the external classical field. 

The basic sum rule Eq. ( |109|) also has a transparent interpretation revealed in 



|T8fl . From the gluon point of view the semileptonic decay of the b quark is an 
instantaneous replacement of b by c quark. In ordinary QM the overall probability 
of the produced state to hadronize to some final state is exactly unity, which is the 
first, leading term in the r.h.s. of ( |109|) . Why then are there any nonperturbative 
corrections in the sum rule? The answer is that the 'normalization' of the weak 
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current 07^756 is not exactly unity and depends, in particular, on the external gluon 
field. This appears as presence of local higher-dimension operators in the current. 
Indeed, expressing the QCD current in terms of the nonrelativistic fields used in 
QM one has 

' 'aiD)a k (aiD)) +o(^)\<p b . (117) 



4m c m6 / \m 3 

The weak current 0757^6, according to Eq. (|117 ) converts the initial wavefunction 



W h into 



' (aiD)ak{aiD) + ^ B . (118) 



Then it is easy to calculate the normalization of ^ : 



2 _ IhT, l|2 /4 /4~/4 ( 1 1 



_ JZL. _ ^ — ^ I — — 1 — 4- — = — ) - ... , II^JI 2 = 1 . 
3ml 4 ym 2 m 2 3m c mb J 

(119) 

The additional terms are just the nonperturbative correction in the right-hand side 
of the sum rule. 

The similar relation holds for the zero-recoil vector current C'job : 

c 7o & <- <p+ (l - I ( ^2 + \ ~ — J i m ^f + C (A) 1 ^ ( 12 °) 

and, therefore, the normalization of the resulting wavefunction in these transitions 
is 



d xeyoK x ) 



4 \ mf m c rrib I \ rrin 



(121) 

The first two l/m^ terms in the bracket in Eq. ( |117| ) or terms 1/m 2 , 1/wi 2 in 
Eq. (|120|) are the result of the Foldy-Wouthuysen transformation Eq. fllBD . Cor- 
respondingly, they were missed in the standard HQET analysis used, for example, 
in |L4]|. The dominant effect 1/m 2 was thus basically lost. It would be correctly 
reproduced in the approach of the Mainz group ||; however, the issue was not elab- 
orated in enough detail for practical applications. The above inconsistency was first 
noted by A. Le Yaouanc in 1994 ||16|| . It should be noted that the Neubert's current 
numbers for Sy m2 and Fjj* ascending to |49] and quoted in later reviews, and which 
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are routinely cited by experimentalists, were derived in ||9| using the same incorrect 



expansion. The mistake has not been corrected yet. It was the origin of the claim of 
Ref. |E5] of insensitivity of the analysis to the value of fj,^ which clearly contradicts 



Eq. (H). 

The QM interpretation of the sum rules explains why the perturbative factor 



£a0^) is the short-distance renormalization translating the full QCD axial current 
into the current in the effective theory. Often the //-independent T]a is called a 
short- distance factor. It is not quite correct, t)a does not depend on the choice of 
the separation scale \i determining which effects are considered as short- distance and 
which are still included into the low-scale physics. It would be possible only if the 
perturbative corrections at arbitrary scale vanish, which is not the case. In reality 
t)a is a mixture of short- distance and long-distance effects. Since, nevertheless, it 
is defined perturbatively, it suffers from the IR renormalon uncertainties similar to 
those discussed for the pole mass. They are too significant in r\ A - 

3.5 Summary on |V^,| 

I would like to give here a brief summary of the two methods of extracting |V^,|. 
The most precise at the moment is the value obtained from T S \(B): Eqs. ( |5"TD and 
Eq. fl5"2|) with the central experimental input values shown there lead to 

VJl* 0.0419. (l- 0.012 -y ). (122) 

The overall relative theoretical uncertainty in this result is 6th ^ 4%. With future 
refinements implementing already elaborated strategy, we can expect reducing it 
down to 2%. It is important that I am speaking here about the defensive theoretical 
accuracy, the error bars that are assumed to cover the whole interval where the 
exact number can be. I think that the theoretical predictions - to the extent they 
are not affected by independent experimental input subject to additional statistical 
uncertainties - should have such a status. The way when the previous predictions 
are praised for being "only 2.5a away" from a posteriori updated estimates, seems 
to be inappropriate evaluation of the theoretical analysis. Q To allow for such a 
confidence, one should not combine different theoretical uncertainties in quadrature 
as it is accustomed with statistical fluctuations, and I rather added them linearly. 

The B — > D*lv zero-recoil rate also provides a good accuracy. The exact exper- 
imental status of the measurements extrapolated to q = is not completely clear 
to me at the moment. I heard different opinions, and leave the final word for the 
experimental experts. Using the reported average value and the estimate Fp* ~ 0.9 
a somewhat lower value of \V c b\ ~ 0.038 seems to emerge. It is interesting that 
the central theoretical value, according to Eq. ( |115| ) exhibits the dependence on 



18 The idea that the theoretical predictions can be wrong - outside the stated intervals - in 
almost a half of cases sounds somewhat humiliating. 
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similar in magnitude but opposite in sign to Eq. ( |122 ): 



) 



(123) 



The theoretical uncertainty 5 t h here constitutes probably 5 t h ~ 6%, however this 
estimate relies on additional mild theoretical assumptions. It is not clear how at 
the moment it can be decreased. Speaking of feelings, personally I think that Fd* 
larger than 0.95 or below 0.85 would be unnatural. Even the end points of this 
interval look extreme. However, the existing analyses do not allow to forbid them 
confidently, and the feeling itself may be merely a result of a historically formed 
common belief rather than based on scientific facts. 

It is remarkable that the values of \V c b\ that emerged from exploiting two theo- 
retically complementary approaches are very close. The progress was not for free: it 
became possible only due to essential refinements of the theoretical tools in the last 
several years, which prompted us, in particular, that the zero-recoil B — > D* form- 
factor Fd* is probably close to 0.9, significantly lower than previous expectations. 
The decrease in Fd* and more accurate experimental data which became available 
shortly after, reduced the gap between the exclusive and inclusive determinations of 



A few years ago such an accuracy and agreement would be considered as an 
important success. Nowadays we are more demanding to the heavy quark theory. 
The difference in the two values of \V c b\ from a certain perspective may look as a 
hint for the discrepancy: the central value of \V c b\ from B — ► D* decay seems to be 
somewhat lower than that from T S \(B). Since both theoretical values depend to a 
certain extent on the precise magnitude of /i 2 , it is tempting to think that the actual 
value of /i 2 is somewhat larger than the "canonical" 0.5 GeV 2 . Increasing it by about 
0.2 GeV 2 makes the two results much closer. This would not really contradict known 
facts. I hasten to add, though, that the existing experimental error bars are such that 
speculations about adjusting /i 2 are premature. Moreover, theoretical uncertainties 
in the exclusive formfactor also preclude us from the above adjustment of /i 2 (Fd* 
can well be, say, 0.87 even at the canonical value /z 2 = 0.5 GeV 2 ). The ways to 
sharpen the theoretical predictions of T s \ are more or less clear. Future accurate 
measurements will, hopefully, allow one to directly measure - through comparison 
with r s i(S) - the exclusive formfactor with accuracy better than that achieved by 
today's theory. Thus, we will get a new source of information on intricacies of the 
strong dynamics in a so far rather poorly known regime. 

Changing the parameters like /x 2 per se within reasonable intervals is not tightly 
limited at the moment. However, it may well imply other related changes - the size 
of l/m c corrections for formfactors at v ^ 0, delaying onset of duality with smaller 
values of [1q(0.8 GeV), etc. Not all these links are properly taken into account 
account in the analyses. The comprehensive study is still to be done. One element 
is already clear. In most of the existing theoretical analyses it is essentially assumed 



v cb \. 
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that the approximate duality between the actual hadronic amplitudes and the quark- 
gluon ones sets in already at the excitation energies ~ 0.7 to 1 GeV. While there 
are no experimental indications so far that this is not the case (at least, in the 
semileptonic physics), the proof is not known either. If that is not true, and duality 
starts only above 1 GeV, most probably one would have to abandon the idea of 
accurate determination of \V c b\ from the exclusive B — > £)W transitions. The only 
option still open will be the inclusive semileptonic decays where the energy release is 
large, ~ 3.5 GeV. Of course, in such a pessimistic scenario the theoretical precision 
in \V c b\ will hardly be better than 5%. 



3.5.1 Comments on the literature 

Since the question of the accurate determination of V c b is very topical, there is a 
wide spectrum of opinions here. Speaking to the experimental colleagues, it would 
not be fair to avoid mentioning alternative suggestions. Over the last few years with 
the developing of the genuinely QCD-based methods, general convergence of central 
values in the estimates is clearly observed. The controversy mainly shifted to the 
complicated issue of estimating theoretical uncertainties. 

Those who are familiar with ongoing debates can notice that my assignments for 
the theoretical uncertainties in T s \ are on the lower side of the spectrum. For the 
B — ► D*lv zero recoil rate they are closer to common estimates (with the exception 
of Neubert who pretends to a significantly more accurate prediction). The smaller 
uncertainties in T s \ are anticipated a priori. First, T s \ is a genuinely short- distance 
quantity while Fd* is not, starting l/m 2 c effects. Second, the normalization of Fp* 
rests on the heavy quark symmetry and thus suffers from downgrading the expansion 
parameter: it is l/m c rather than l/m^. The task of the QCD-based HQE was to 
make full use of these advantages. 

Let me first dwell on B — > D*iv. In general, the statements of a good accuracy 



here have been merely transferred from the old reviews |8|, |94| claiming three times 
smaller corrections than today and, correspondingly, small error bars. While the 
central value quoted by the author went, in a few iterations, down to the current 
prediction for F D *, a similar due revision did not affect the suggested error bars. A 
closer look shows that they are not better motivated than before. 

The central value quoted by Neubert now does not differ at a discussible level 
from the original estimates of the heavy flavor sum rules PTJ . The analysis is based 



anyway on the very same relations and assumptions. It is stated, in addition, that 



the paper |49| where the sum rule analysis was reproduced, has been complimented 
by some symmetry relations; this was called later the "hybrid" approach by the au- 
thor. In fact, the sum rules manifestly respect the heavy quark symmetries, and the 



latter cannot provide any further constraint. Careful studying of the analysis of 



may seem complicated due to presence of numerous parameters and notations as well 
as contours on plots called upon to make numerical conclusions looking convincingly. 
It can be done nevertheless, and shows that the analysis of the 1 / m 2 terms merely 
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was not correct. The flaws were briefly discussed in j|T] and were conceptually 
related to missing the contributions from the Foldy-Wouthuysen transformation. 
The basic points which led Neubert to the small uncertainties were the following: 

• discarding \jw? c and higher-order corrections altogether 

• ad hoc fixing the value /i 2 = 0.4 GeV 2 in the analysis 

• the claim of insensitivity of the result to /z 2 related to missing the term (/x 2 — 

Curing these flaws returns one to the estimates given in Sect. 3.4. 

Another flaw of the estimates by Neubert is improper account for the radiative 
corrections. The formfactor Fp* was judiciously split into two parts. The first was 
"purely perturbative" factor t]a, and all the rest was just called the nonperturbative 
corrections: 

tf/m' + *l/m» + - = F D * ~ VA ; (124) 

t]a was defined as a sum of the formal perturbative series calculated without any 
infrared cutoff. Such a quantity cannot, however, be consistently defined even if 
our knowledge of the perturbative coefficients were practically unlimited. The as- 
sociated numerical uncertainties are quite significant. Even adopting an optimistic 
convention for the uncertainty with the ad hoc factor 1/tt in front of the imaginary 
part of the Borel-resummed series (like in Eq. (|6"5p), one gets for the and 
I/ttiq irreducible uncertainties 

,r ,ie- (AlD) 2 (_i + _i) 2 , 0.022(A) 2 (125) 



5/2 ,.™ xs/ll 5 5 11\ _ / AgL V 



s i/ m » „ ^_ ( a ms y ii , , , ±L ~ 014 QCD 

dm - 72 [ a qcb) ^3 + m 2 mb + mcm 2 + ml) ~ U ' Ui4 ^250 MeV ) ' 

(126) 

respectively (higher-order IR renormalons are smaller). Thus, the uncertainties 
in both "perturbative" and "nonperturbative" corrections defined in such a way 
somehow appeared smaller than their irreducible intrinsic uncertainty. 

To avoid confusion I must note that Eq. ( |125|) was obtained in [ rP[ . This, how- 



ever, contradicted the claimed high accuracy of calculating 7]a p2| . Hence, already 
in the next papers the numerical estimate for this uncertainty was skillfully reduced 
by a significant factor. It was achieved by simple two-step manipulations: the value 
of (5j^ m was first expressed via the square of the relative IR uncertainty in the mass, 
Eq. d65|) . Then a rather low numerical uncertainty in the mass was used. In this way 
^ir™ seemed to emerge only 0.006, to the author's satisfaction |49], |95|]. Using the 



direct formulae, however, one gets about 4 times larger uncertainty. For this reason 



19 The insensitivity was illustrated by varying in a very narrow interval from 0.36 GeV 2 to 
0.5 GeV 2 . This allowed to bury the impact of changing /1 2 under uncertainties in other model 
parameters. 



66 



the two expressions for Sl^ 71 in t\a and 5ir in mg le were never quoted together, and 

the uncertainty 5^ was always numerically evaluated by Neubert in this "indirect 
way" via more or less arbitrarily defined 5m^ Xe rather than using the known explicit 
expression. 



The practical inconsistency of the approach adopted in |49| is well known. Phys- 
ically it leads to double counting of the large- distance domain - once as a part of the 
perturbative effects, then in the nonperturbative corrections. Numerically it is quite 
significant in the case of Fp*. For example, Ref. pointed out that the bounds 
used by Neubert for Fd* — tja are too strongly offset by the perturbative corrections. 
In fact, the theoretical inconsistency of such an approach were even noted by the 



author himself in [ |5| , where the results were stated conditionally valid provided the 



corrections would not alter them significantly (according to |96 |, they just do). This 



qualification has never appeared later in the numerous author's papers quoting |f49| . 

In any case, it is easy to see that the bounds of the type used in |49| are justified 
only if the perturbative factor £Y 2 (/u) is used instead of tja, see Eqs. ( |109| , |1 1 1| ) . 



This short-distance factor is well calculated now ||87| , and is significantly larger than 
the value used by Neubert. It is not clear whether the author plans to revise those 
inconsistent estimates, and what will be the central value of the updated numbers. 
Taking everything at face value, the central value for Fp*, according to Neubert, 
must have been close to 0.94. 

Now, let us turn to T s \. In spite of the accurate experimental data and a well- 
developed theoretical description, determination of V c b in this way often unjustifiably 
discarded being ascribed inflated uncertainties. As I mentioned before, the origin 
was the misconcept of the pole mass flourishing in HQET a few years ago. In par- 
ticular, it was tacitly assumed that the quark pole mass has an unambiguous value. 
Then it was observed that: 

(a) It is difficult to accurately extract ml° le from experiment. In any particular cal- 
culation one can identify effects left out, which can change its value by ~ 200 MeV. 
This uncertainty leads to a theoretical error 5i ~ 10% in T S \(B). 

(b) When routinely calculating T s i(i?) in terms of the pole masses, there are signif- 
icant higher order corrections 5u ~ 10%. 

Thus the conclusion was made: T S \(B) cannot be calculated with accuracy better 



than 5i + 5n ~ 20%, and, correspondingly, at best <5| V^l/I ~ 10% [[49 . 

Both observations (a) and (b) above are correct, beyond any doubt. If one more 
step is taken, however, the conclusion is invalidated - one should take into account 
the fact that the origin of these two uncertainties is actually the same, and therefore 
they practically cancel each other. 

Since the nature of the heavy quark in the QCD-based HQE was clarified in 
the numerical relevance of these theoretical facts was illustrated in a number of 



publications |4^, [46| |97|, [98|, p2| . Nowadays only the above two problems allegedly 
impair the precise calculation of T s i. the danger of large perturbative corrections 
and the uncertainty in m?,. The anatomy of large corrections and their cancellation 
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was described recently, for example, in the review [[ITJ. Here I would like to give a 
complimentary illustration. 

The HQE in QCD expresses the width in terms of the short-distance mass m^di) 
with fi ~ 1 GeV. Such a mass is well defined and has a certain value. We do 
not know it precisely, of course. The HQE assumes, nevertheless, that it can be 
extracted from experiment with theoretically unlimited accuracy. Let us adopt 
here the determination from the moments of e + e" — > bb cross section discussed in 
Sect. 3.2.3. Thus, we calculate the semileptonic width directly via V c b and <jg+ e _(s). 
Reference to the quark mass is made only for computational purposes. Let us also 
neglect the small nonperturbative corrections at this point and fix the value of 
rrib — m c . Both T s \ as a function of m^fi) and the mass rrib(fi) itself as expressed via 
<7g+ e _ (s) undergo perturbative corrections. How important is their overall impact? 

To see this, we can first neglect the perturbative corrections altogether. It is 
important that it must be done at both stages of extracting V c b simultaneously: it is 
not legitimate to keep 0(a s ) correction in one part and discard it in another part of 
a single relation. If we neglect them consistently, we would get a certain value which 
we can denote | V^ 1 | . It is somewhat smaller than quoted in Eq. fl5T|) . Already here 
the two competing effects are apparent: we would use a smaller nib — 4.60 GeV but, 
at the same time, discard negative perturbative corrections to the width. 

Now we include 0(a s ) corrections. Again, we must use the same (not running 
in the first order) value of a s at both stages. We get a new value, |V^|. It appears 
that 

iVi'W^l * 1-06- (127) 

The effect of the leading perturbative corrections changes \V c b\ at a 5% level only! 
This numerical result is easy to interprete also in terms of the usual calculations ex- 
pressed via the pole masses: the first-order radiative effects shift m^ ole from 4.60 GeV 
up to 4.82 GeV, but bring in the perturbative suppression factor about 0.8. 

One can incorporate the BLM improvement, that is, account for running of a s , 
again in strictly the same way in T sl and cjg+ e _(s). The impact of, e.g., the second- 
order corrections changes |T4&| by less than a percent PSJ. And this magnitude of 



the 0(a 2 s ) effects is quite expected if the 0(a s ) correction was only a few percent. 
The all-order BLM resummation does not change this conclusion. 

The complete 0(a 2 s ) analysis of extracting mb([i) from <7g+ e _(s) has not been 
done yet, and we cannot literally extend this comparison to the non-BLM effects at 
order a 2 s . Nevertheless, it is difficult to imagine how a significant effect exceeding a 
percent level can emerge from it. 

The usually cited 'naive' evaluation of the magnitude of the perturbative correc- 
tions in the width differs from the above in one but quite important aspect. It forgets 
that the pole mass nib routinely used there has to be perturbatively calculated anew 
in every order, with the numerical effect similar to Eq. flSop. Instead it is treated as 
a fixed value. It is reasonable to ask - which exactly? The answer is not clear, since 
only the "correction factor" is quoted. And the corrections look significant indeed. 
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It only remains unclear to a reader how with these huge higher-order correction the 
final number quoted for \V c t,\ differs from what had been obtained without any re- 
finements [2TJ by less than 1%, and even neglecting the QCD corrections altogether 



yields a 5% accuracy. The reason is explained above. 

3.5.2 Analyticity and unitarity constraints on the formfactor 

Extracting \V c b\ from the exclusive decays implies extrapolating data to zero recoil. 
Near zero recoil statistics in the decays B — > D*lv is very limited, and the result 
for the differential decay rate at this point is sensitive to the way one extrapolates 
the experimental data to q = 0. Most simply this is done through linear extrapola- 
tion. Noticeable curvature of the formfactor would change the experimental value 
for \V c bFr>*(0)\. Under the circumstances it is natural to try to get independent 
theoretical information on the g 2 -behavior of the amplitude. 

Some time ago it was emphasized |99[ that additional constraints on the q 2 - 
behavior follow from analytic properties of the B — > D* formfactor considered as a 
function of the momentum transfer q 2 , combined with certain unitarity bounds. In 
particular, in the dispersion integral 

, 9 . 1 r , ImF(s) 

F(q 2 = - / ds Y 128 

7i J s — q l 

the contribution of the physical s-channel domain q 2 > (Mb + M D (*)) 2 is bound 
since \F\ 2 describes the exclusive production of BD and cannot exceed the total be 
cross section. The integral (|128|) receives important contribution from the domain 



below the open be threshold, where F(q 2 ) has narrow pole-like singularities corre- 
sponding to a few lowest be bound states. Introducing a set of unknown residues and 
making plausible estimates of the positions of these bound states, on the one hand, 
and adding a small non-resonant subthreshold contribution, on the other hand, 



Refs. [|100| suggested an ansatz for the formfactor in the whole decay domain. 
Since the residues are unknown and the momentum transfer in the actual decays 
B — > D^iv varies in a rather narrow range, the advantages of this parameterization 
over the standard polynomial fit are not clear at the moment. 

A much more stringent relation between the slope and the curvature of the form- 



factor (i.e., F'(q ) and F"(q ) at zero recoil) was claimed in |]101[ . Unfortunately, 
this relation was hastily incorporated in some experimental analyses. It was shown 
in |98| , |4l|j that the analysis of Ref. [|101|| was erroneous, and correcting the flaws kills 
the predictivity of the relations discussed there. This conclusion was later confirmed 



in 



A different few-parameter ansatz for the shape of the effective B — ► D* formfac- 
tor was also suggested in ||102|| . Although it seems to be one of reasonable possible 



ansatse, its model-independence was exaggerated. The authors relied too heavily 
on the approximate heavy quark symmetry in the threshold region. The indirect 
form in which these constraints were imposed hid this fact. Additionally, the shape 
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is sensitive to the exact structure of all the subthreshold be states, which were taken 
from potential models whose reliability for the higher excited states is suspicious. 
A detailed consideration reveals that one should allow for larger corrections to the 



parameterization suggested in [102 



3.5.3 On the 1/jtiq corrections to the widths and duality violation 

The question of the asymptotic expansion of the inclusive widths at the nonper- 
turbative level was not too obvious, as illustrated in Sect. 3.1. Even though the 
HQE answered it, this question is still occasionally raised in the literature. The sug- 
gestions of possible 1/rriQ nonperturbative effects in different form still flash every 
now and then || ^T], |29fl . The experimental evidences that BR s i(S) and the 

lifetime T\ b seem to be lower than suggested by theory additionally provoke such 
speculations. 

There are a few reasons behind such a doubt. One of them lies in purely technical 
aspects. The traditional approach of HQET to the semileptonic widths ||104j| is to 



calculate only the differential semileptonic width r s i(g 2 ) where the lepton invariant 
mass q 2 is fixed. The total width is finally obtained integrating r s i(g 2 ): 

(M B ~M D f 

dq 2 r sl (g 2 ) (129) 

(the case of b — > u is simply obtained by setting M D — m c — 0). At not too 
large q 2 this differential width indeed can be easily calculated including the leading 
nonperturbative corrections. For example, up to 1/m 2 effects it is given by 



G 2 F \V cb \ 2 \ {m\ - m 2 ) 2 + q 2 {m\ + m 2 ) - 2q 



96vr 3 



mi 



<o\- Aqcd m 1 ) nm 

K m b - m c - y/q 2 ) t 

However, at q 2 approaching {m b — m c ) 2 the 1/toq expansion breaks down and 
the differential width cannot be evaluated. In this soft domain no literal 1/mg 
expansion is available. For example, with both b and c quarks heavy enough, at 
maximal q 2 the actual hadronic width differs from the quark width by the universal 
factor _ 

r h ad(<? 2 ) _ 1 + 3 A(m b -m c ) 



r qua rk(<? 2 ) 2 m b m c 

which explicitly includes 1 / m c correction. Although decreasing q 2 somewhat dimin- 
ishes the ratio, even after the D** threshold the deviation is still of a similar order, 
i.e. scales like 1/mg. 

At the same time, the overall contribution to the semileptonic width from this 
domain of large q 2 is a priori at the relative level of AqcD/( m -fe ~ ^c), so it has to be 
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calculated to address even 1/mg effects. Moreover, the upper limit of integration 
over q 2 is clearly dictated by the hadron rather than quark masses; this is particularly 
manifest for b — > u where Mjj — > m n ~ 0. Naively one could have expected the 
width to depend, at least partially, on the hadron rather than quark masses. 

This problem of the HQET approach underlies the reason why, for example, 
the paper ||104|| allowed for the possibility to have 1/rriQ nonperturbative effects 



even in the semileptonic widths. (For the nonleptonic widths such an approach 
is not applicable at all.) In this language the QCD statement of vanishing such 
nonperturbative effects is quite nontrivial: while the width T s \(q 2 ) cannot be literally 
calculated in the above domain, the integral over q 2 

dg 2 r sl (g 2 ) (132) 

min 

can be, provided mj — m c — J q^ n ^> Aqcd- This fact automatically emerges in the 
approach suggested in |25|, |26|, |3£|. To get it in the HQET-type approach required 
usage of the sum rules for the heavy quark transitions |18| , and was demonstrated 
recently in |32| . 



It is worth emphasizing that the QCD-specific sum rules guarantee that the 
leading nonperturbative effect affecting the phase space in the semileptonic decay 
cancels against the corrections to the hadronic formfactors, the fact which is difficult 



to ensure in ad hoc quark models. This cancellation was apparently missed in p0 | . 
As a matter of fact, the cancellation persists even in the perturbative effects up to 
momenta ~ m^/b. 

The paper || claimed that there is an independent d = 4 operator in HQET 
capable to affect the semileptonic widths producing l/m^ nonperturbative correc- 
tions. Such a statement is unacceptable. I think that the confusion was merely a 
matter of ill-formulated expansion procedure. 

A similar deficiency affects in my opinion the analysis ||103|| . The calculation of 
the width included a set of the nonperturbative parameters among which there was A 
governing l/m,b corrections explicitly introduced in the analyses. The inconsistency 
was again rooted to relying on the hadron masses. The QCD states, on the contrary, 
that for the B meson width r s i(5) the B meson mass Mb is no more relevant than 
the mass of At,, or Mb s , or any their combination! 

I cannot avoid mentioning that lobbying a possibility of 1/rriQ corrections got 
recently new impetus. Resurrecting the previous misconceptions Grinstein et al. 
|3~T| claimed establishing the OPE- violating 1/jtiq terms in the inclusive widths on 
the example of the 't Hooft model, the large-A^, two-dimensional QCD. I cannot 
dwell on details here, but can assure the reader that these results are wrong. 

To summarize, there must be no reason so far for admitting a possibility of I/tuq 
corrections in the inclusive widths, even though it may not be obvious in certain 
approaches. 

A more subtle problem is violations of local duality which can affect the widths, 
e.g. semileptonic, at a certain level. Reliable information on this aspect of QCD 
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is scarce. The only model specifically designed to address the issue is presented in 
Ref . |33|] . It predicts negligible deviations from duality in the semileptonic decays of 



B mesons. Irrespectively of the model, they must be small. In this respect I would 



disagree with Bj who tried to argue in favor of naturalness of significant effects [105 



The arguments examined in more detail rather suggest a small scale for the effects. 
Of course, it cannot be interpreted as a rigorous proof. 

The violation of duality must fast decrease with energy. It is worth to recall 
that in a similar four-fermion hadronic decay width of r lepton one only has E = 
m T ~ 1.78 GeV. Nevertheless, it is often believed that the duality holds there at the 
level of 1 or 2%. (More realistically, the scale of the violation is around 5 to 7%.0) 
In b — > civ the energy release is 3.5 GeV. Moreover, there must be an additional 
suppression which can be revealed by varying m c as a free parameter. 

In b — > civ the energy release is the smallest when m c increases and approaches 
mfe. In this limit (it is just the SV limit) there is an exact duality at least up to 1 jm\ 
terms, the fact emphasized first by Shifman and Voloshin in the mid 80's in one of 
the first papers [^] established the base of the HQE. The effect can be evaluated 
similar to F D w and is below a percent level. 

In another extreme limit m c — > the heavy quark symmetry does not apply, 
however the energy release approaches 5 GeV. At such a scale it also must be 
negligible. It is hardly possible that the duality violation can have a strong peak 
at an intermediate m c . Comparing to r leptons, we thus have an essentially larger 
energy parameter and an important suppression of the duality- violating effects by 
the heavy quark symmetry. It is hardly reasonable in this situation to allow a much 
stronger effect compared to 'unprotected' hadronic r decays. 

It is interesting to discuss another argument by Bj who notes that in e + e~ an- 
nihilation one still has a significant resonance structure up to E = ^ « 2.5 GeV. 
In B decays the energy is safely above this domain, but in general one has to be 
careful relying on this experience. For the kinematics of hadronic transitions are 
quite different. In e + e~ we have a point-like production of the initial quarks while 
in B decay the final state quarks start evolving at large separation ~ Aqcd after 
the decay. This 'multiperipheral' nature in the heavy quark decays determines the 
difference in the nonperturbative effects which start with I/ttlq vs. l/E 4 in the 
annihilation. 

This peculiarity is reflected in the duality-violating effects as well, in particular 
in r decays. Considering only the vector channel decay rate one experimentally 
observes larger, sizeable violations of duality. They are strongly washed out as soon 
as the axial channel is added when more low-energy resonances contribute. In this 
respect r s i(i?) has a clear advantage: a variety of final states with different angular 
momenta is much more rich here even without producing extra quark pairs. 

The scale of onset of duality is typically governed by the masses of the first 
prominent resonances. In r decays they are M p and m n , M ai for V and A, respec- 



20 Since the effects of duality violation must oscillate as a function of m T 1 33 
they nevertheless may vanish. 



at a certain mass 
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tively. Such scale is lower in the semileptonic decays. For example, the P wave 
excitations in b — > c have the mass gap ^ 500 MeV. It is directly related to the 
richer quantum number structure of the decay. Even radiatively excited states are 
expected below Md + 1 GeV. 

As a result, a closer look at the potential problems with duality rather suggests 
that it should work better in Y S \(B) than in T^^t). I think that a more dedicated 
analysis is to be done to reveal reasons - if any - why the former may be more 
vulnerable. So far all arguments suggest that T s i(i?) is safer. 



3.6 Lepton Spectra and Decay Distributions. 
Model-insensitive Determination of \V u b\ 

The inclusive differential distributions in semileptonic (or similar radiatively-induced) 
decays are treated in the HQE similarly to the inclusive widths |39], |106|| . An 
important fact pointed out in is that the QCD-based OPE automatically leads 
to an analogue of nonrelativistic motion of the heavy quark inside the hadron. This 
"Fermi motion" was phenomenologically introduced long ago, first in ||107| and then 



elaborated further to the status of a well-formulated model ||108|| . A similar phe- 
nomenological approach was later used in a number of papers ||109| . A detailed 
description of the Fermi motion in the framework of the 1/itiq expansion was later 
given in [||, [Il0| and in [pTT[] . 



The "Fermi motion" in QCD has certain peculiarities which are absent in the 
phenomenological models [19|, |1 1 2|| . The distribution over the 'primordial' Fermi 
momentum F(p) is replaced by a certain distribution function F(x), where x < 1 
measures the momentum of the b quark in the units of Mb — m^. First distinction 
is that F(x) is one-dimensional; one can define only the distribution over a certain 
projection of the momentum. 

Second, F(x) depends essentially on the final state quark mass (more exactly, on 
its velocity). While it is the same for b — *> uiv and b — > s + 7 where the final quark 
is ultrarelativistic, it is rather different for c quark in b — * civ where it is closer to 
a nonrelativistic particle. 

Finally, F(x) is actually normalization-point dependent. 

Although F(x) depends on the final state quark mass, there are still certain rela- 
tions between the moments of F(x) at arbitrary mass. The moments are expressed 
via the expectation values of the local heavy quark operators; these relations are 



examples of the generic sum rules for the decays of heavy flavors (18| . In particular, 
the 'center of gravity' of the distribution over the primordial momentum points at 
the heavy quark mass mi,. The second moment of F(x), its dispersion, is propor- 
tional to the kinetic expectation value The third moment is proportional to the 
Darwin term, etc. Let me mention one fact ||112| , |110|| which is often missed: the 



AC 2 M 2 model ||108|| does correspond to a certain QCD distribution function F(x) 



the so-called 'Roman' function which however is not Gaussian. Moreover, the value 
of in this model is not given by pf itself but is a more complicated function of 
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Pf and m sp , the two parameters of the AC 2 M 2 model. 

It appears that the effects of Fermi motion are very important for a possible 
determination of V u b- We know certainly that \V u h\ 7^ holds since (i) the decays 
B — > niu, plv have been identified and (ii) B — > X iv has been observed with 
lepton energies Eg that are accessible only if X does not contain a charm hadron: 
Ei > (Mb — Md)/2Mb = 2.31 GeV. Translating these findings into reliable numbers 
for \Vub\ is a much more difficult task theoretically: On the one hand the exclusive 
decays B — > ixtv, ptv depend on bound-state effects in an essential way. On the 
other hand in analyzing the endpoint spectrum for charged leptons in the inclusive 
decays B — > X Iv one encounters different sorts of systematic problems. Only 
a fairly small fraction of the charmless semileptonic decays B — > X u £i>, namely 
around 10% or so, produce a charged lepton with an energy beyond that possible for 
B — > X c tv. The data suggest that |V^|/|Vd,| ~ 0.1, however the actual estimates 
vary within a factor of two. In addition, the b — > c rate is so much bigger than 
that for b —>■ u that leakage from it due to measurement errors becomes a serious 
background problem; furthermore the endpoint region is particularly sensitive to the 
nonperturbative dynamics. 

Since the inclusive semileptonic width T s \(b — ► ulv) is reliably calculated theo- 
retically, Eq. Q53D, the best way to determine \V U \\ would be to accurately determine 
BR s i(6 — > X u iv) where X u denotes the charmless hadronic states. It is not yet com- 
pletely clear how reliably this fraction can be measured in experiment. Of course, 
the problem is not statistics per se. 

The most direct way to disentangle b — > utv from b — > civ without tagging the 
secondary charm decay would be to study the invariant mass of hadrons in the final 
state Mx- 




(133) 



For the free quark decay one has M x ~ in b — ► u and M\ = m 2 c for the b — > c 
transitions. In actual decays the mass can take values exceeding m w and Md, respec- 
tively. The increase in mass can originate both in the perturbative processes when 
a hard gluon is emitted in the decay, or through soft bound-state or hadronization 
processes. The leading soft effects emerge due to same physics which gives rise to 
the Fermi motion. They are quite significant. For example, the average invariant 
mass square of hadrons in b — > u gets the nonperturbative correction ~ Aqcd • m b 
| TT3| : 

(M 2 x ) nonpcit = ^m b (M B -m b ) + O (A 2 QCD ) ~ 1.5 GeV 2 (134) 

(the suggestion of ||104|| about the absence of these relative l/mq corrections to M\ 
was erroneous). The perturbative corrections lead to (M x ) pert ~ ^m 2 , however in 
the B decays this increase is still smaller than the nonperturbative corrections. 

The above estimates already show that the broadening of the Mx-distribution 
is quite essential; whether its impact is insignificant or dramatic for discriminating 
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the two semileptonic decay channels, was not clear a priori and required numerical 
analysis. It was done in ||1 14 1 . The charm quark mass happens to be just near 



the borderline separating the domains of 'heavy' quark m c ^> wAQCD^b where 



the effect of broadening is small, and of 'light' quark m c <C yAQ CD m b when the 
kinematic difference between the inclusive b — > u and b — ► c decays is buried under 
the hadronization effects. 

To quantify the effect of the strong interactions let us introduce, following 
Refs. [|114| , |1 1 5|| the fraction of the b — > u events with Mx below a certain cut- 
off mass M max : 

1 rM msx AT 

*< M »«) = r(6^)/„ iMx lM- x - < 135 > 

Clearly, $(0) = and $(M^) = 1 independently of strong interactions. The main 
question to theory is whether it can calculate accurately enough $(M max ) with 
M max £ 1.5 GeV. 

The dedicated analysis was carried out in ||1 14| , |115|| , and the conclusion appeared 
to be quite optimistic - the strong interaction effects in b — > utv are not expected 
to populate the domain above Mx = 1.5 GeV too significantly. The theoretical 
predictions for the spectrum can be found in the above papers. Later a similar 
analysis was undertaken in [|116| . 



As anticipated, the mass spectrum is broad and extends even beyond Mb - yet 
only a small fraction does so, namely ~ 10%. Due to measurement errors there will 
be a tail from b — > c transitions below Mp- To avoid this leakage one can concentrate 
on recoil masses below a certain value M max . The actual choice of M max is driven by 
competing considerations: the lower M max , the less leakage from b — > c will occur 
- yet the smaller the relevant statistics. Even more important, the accuracy of the 
theoretical predictions deteriorates at lower cutoff mass. The predictions and their 
major uncertainties are illustrated in Figs. 5 taken from [ 115| . 

This analysis showed that it seems important to reach a value of M x ~ 1.5 GeV 
in the reliable measurement of the fraction <3>(M max ). If the necessary discrimination 
of b — > c decays can be achieved at such masses, a rather precise model-insensitive 
determination of \V u b\ and/or |14b|/|K:&| will be possible. Those who are interested 



in further details can find them in Ref. |115| . 

I have to note that the conclusions of paper ||116|| were less definite, and its pre- 
dictions suffered from larger uncertainties. The treatment of both the perturbative 
and nonperturbative corrections was somewhat different; I consider some elements 
of the analysis not quite consistent. 

Sharpening the theoretical predictions for $(M max ) would require, in particular, 
a more refined treatment of the perturbative corrections. In the b — > u Iv or b — > 
s + 7 distributions they are not very simple since include strong bremsstrahlung 
corrections that must be dealt with carefully. This will not be a stumbling block, 
however. A certain approximation, the "APS" (Advanced Perturbative Spectrum) 
was elaborated in ||110|| which is feasible and, at the same time, must yield more than 
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Figure 5: The integrated fraction of the b — ► ulv events $(M X ). (a): Dependence on m^. 
Long-dashed, solid, and short-dashed lines are mb(lGeV) = 4.58 GeV, 4.63 GeV and 4.68 GeV. 
(b): Dependence on Long-dashed, solid, and short-dashed lines correspond to /u 2 — 0.3 GeV 2 , 
0.5 GeV 2 and 0.7 GeV 2 . 



enough precision in accounting for the perturbative corrections to the distributions 
even in these transitions. If future studies demonstrate the possibility of accurate 
measurements of the hadronic invariant mass in the semileptonic decays and the 
anticipated accuracy becomes clear, such refined calculations will be done. 

Concluding this discussion, I want to remind that it is very desirable to study 
such distributions separately for B ± and B° mesons. The same refers also to studying 
the end-point region of the lepton spectra. In this way it will be possible to directly 
study and detect the so-called weak annihilation, the nonperturbative effect sensitive 
to the flavor of the spectator quark |113| . Weak annihilation is a relatively small 



1/mf effect in the total widths. However, in the semileptonic transitions it mainly 
originates from the end-point domain, Eg Mb/2 and small Mx- Thus, in these 
studies its relative effect is enhanced. It would have an independent theoretical 
interest as well providing valuable information for the HQE about the expectation 
values of the four-fermion operators, in particular, their nonfactorizable pieces |113|. 



4 Challenges in the HQE 

Concluding my lectures, I cannot avoid mentioning places where the HQE seems to 
have problems when compared to experiment. It is probably premature to speak 
of a direct contradiction; nevertheless, today's question marks carry the seeds of 
tomorrow's advances. Basically there are two problems where our theoretical un- 
derstanding is lagging behind. Both are related to nonleptonic decays. 

4.1 Semileptonic Branching Ratio of the B Mesons and n c 

The theoretical attitude to this problem changes with time. Twenty years ago the 
parton model gave a prediction BR s i(£>) ~ 13 -15%, which was accurate enough 
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according to the existed standards. The variation reflected mostly the choice of 
quark masses. While the rates of c — > civ and b — > cud were affected by the choice 
of m c in a more or less same way, the rate of the b — > ccs channel decreased much 
faster when larger masses (closer to the 'constituent' rather than short- distance 
'current' ones) were used. Therefore, BR s i(I?) increases for larger masses. It should 
be remembered that the difference of m;, and m c is fixed, so the choices of m c and 
rrib are always correlated. Even though the heavy quark symmetry had not been 



formulated, the latter fact was clearly realized at least in the early 80's [ 117 . 

The experimental situation was not very definite and indicated a rather large 
BR s i(5) which reasonably fitted the 'larger' mass option ||118|| . Since then it be- 



came more or less standard to use the larger quark masses, and the parton model 
prediction for BR s i(£>) was accepted to be 13-14%, even though a smaller value 
could be obtained as well. Since the impact of the nonperturbative corrections was 
completely unknown and presumably larger, it was not seriously discussed even 
when the better data became available. 

The situation changed when in 1992 Bigi et al. showed that there are no 1/toq 
nonperturbative corrections to the inclusive widths, both semileptonic and nonlep- 
tonic. Moreover, even the smaller 1/t?t| nonperturbative effects were readily cal- 
culated |25|, [26], |119| |. They appeared to be suppressed, in particular, as a result 



of certain cancellations. The overall effect A nonpcrt BR s i(.B) was found to be about 
—0.5%; the nonperturbative effects could not be blamed for a discrepancy any more 

Pi- 



It prompted a more careful analysis of the perturbative corrections to the widths. 
In particular, the 0(a s ) corrections to the nonleptonic b — > cud width were calcu- 
lated accounting for the non-zero m c [ 121| ; this additionally enhanced the nonlep- 



tonic width. Later the account for the charm mass was also completed for b — > ccs 



122| , |123| , |124|| . Altogether, these O(o. s ) corrections further decreased BR s i(£>) down 
to 11-12%. Although this shift naively seems very significant and may raise con- 
cerns about the convergence of the perturbative corrections, it actually is not dra- 
matic if one starts with more appropriate short-distance masses, the choice forgotten 
for historical rather than rational reasons. It is important to keep in mind that the 
relative increase in the nonleptonic width was particularly significant in the b — > ccs 
channel. 

The issue of the semileptonic branching ratio must be considered in conjunction 
with the charm yield n c , the number of charm states emerging from B decays. To 
measure n c one assigns charm multiplicity one to D, D s , A c and S c and two to 
charmonia. Zero is assigned to the charmless hadronic final state. It is obvious that 

n c ~ 1 + BR(B -> ccs q) - BR(5 -> no charm) . (136) 



Such a joint analysis was motivated already in p5[ : the energy release in b — > ccs 



is rather small, and this can lead to significant duality-violating and higher-order 
effects. The stability of the perturbative expansion also downgrades. Measuring n c 
allows one to effectively exclude this channel from the theoretical calculations. 
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The data on n c were not quite consistent for some time, but finally seem to 
converge to - unfortunately - a smaller, more difficult for theory value. If we take 
experiment and theory at face value, the allowed intervals do not overlap. The 
semileptonic fraction seems too low, about 10.6%, which is possible to accommo- 
date only at n c > 1.25. Such a large number is not compatible with the CLEO 
measurements. It is not clear how it must be interpreted. At this School I heard 
suggestions from the experimental colleagues whose opinion I highly respect, that 
one may not take too closely the discrepancy within two, or even within three quoted 
error bars. Taking such an optimistic (for a theorist) attitude, one could have been 
relaxed. I cannot, however make such a decision myself and have to discuss numbers 
as they are officially quoted. 

In my opinion, it is difficult to accommodate theoretically the value of BR s i(i?) 
below 11.2% if n c does not exceed 1.20. Those who say that it is easy to obtain it 
in the SM, probably stretch uncertainties too liberally. Of course, even if eventually 
experiment arrives at the current central values, we will not have to revise the SM. 
One possibility would be that the second-order perturbative corrections in the non- 
leptonic modes are grossly underestimated and too significant. Another - probably, 
more realistic - is the violation of local duality in the nonleptonic widths; I'll return 
to it a bit later. A priori, one could expect significant, ~ 15% nonperturbative ef- 
fects in the widths. The actual problem lies in the fact that the leading corrections 
are calculated and happen to be small.Q 



4.2 Lifetimes of Heavy Flavor Hadrons 

As stated before, differences between meson and baryon decay widths arise already 
in order 1/toq. The perturbative corrections to the lifetime ratios are completely 
absent. The lifetimes of the various mesons get differentiated effectively first in order 



I/tUq. A detailed review can be found in [ 125 , |126 



Because the charm quark mass is not much larger than typical hadronic scales 
one can expect to make only semi-quantitative predictions on the charm lifetimes, 
in particular for the charm baryons. The agreement of the predictions with the data 
is good. I would even say it is too good keeping in mind that the l/m c expansion 
can hardly be justified. 

As far as the beauty lifetimes are concerned the l/m^ expansion is to be ap- 
plicable. Table [I] contains the world averages of published data together with the 
predictions. The latter were actually made before data (or data of comparable 
sensitivity) became available. 

Data and predictions on the meson lifetimes are completely and non-trivially 
consistent. Yet even so, a comment is in order for proper orientation. The nu- 
merical prediction is based on the assumption of factorization at a typical hadronic 

21 The coefficient for the leading nonperturbative correction is calculated only in the so-called 
leading logarithmic approximation. There are reasons to think that the additional terms in the 
complete 0{a s ) coefficient can be as large. 
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Table 1: QCD Predictions for Beauty Lifetimes 



Observable 


QCD Expectations (1/mj, expansion) 


Ref. 


Data from 126 


r(B-)/r(B d ) 


l + 0.05(/ B /200MeV) 2 


[ 


PI 


1.04 ±0.04 


f(B s )/r(B d ) 


1 ±0(0.01) 




125| 


0.97 ±0.05 


r(A b )/r(B d ) 


>0.9 




125 


0.77 ±0.05 



scale which is commonly taken as the one where a s (/ihadr) — 1- While there is no 
justification for factorization at fi ~ mj, there exists ample circumstantial evidence 
in favor of approximate factorization at a typical hadronic scale - from the QCD 
sum rule calculations, to lattice evaluations, to l/N c arguments. More to the point, 
the validity of factorization can be probed in semileptonic decays of B mesons in an 
independent way, as was pointed out in ||113| . 

The possible effect of the nonfactorizable contribution has been discussed in de- 
tail in ||113| , |127|| . They include also the nonvalence gluon mechanism discussed long 



ago in [|128|| in the simplified language of the quark model. Significant nonfactor- 
izable contributions would in general lead to large effects of weak annihilation in 
D s mesons where experimentally such effects are quite small. Of course, we cannot 
reliably treat in the I/tuq expansion, and — T D + is sensitive to a particular 
combination of the expectation values of a few four-fermion operators. Nevertheless, 
it is a serous indication that the effects due to nonfactorizable expectation values 
must be suppressed. Thus, the estimates of the preasymptotic corrections to the 
widths based on factorization, most probably, are valid within a 50% accuracy. The 
actual uncertainty in f B is not much smaller; it includes the subleading l/m^ effects 
which formally belong to neglected \jvn\ corrections to Tb- 

The recent paper ||129|| attempted to estimate the nonfactorizable expectation 
values in B mesons in QCD sum rules. Although the calculations were carried out 
in a too simplified manner and cannot be really considered as predictions, they 
showed that there hardly exists room for large nonfactorizable contributions, which 



could yield drastic effects conjectures in ||130|| . In particular, I must note that the 



possible interval 0.8 to 1.4 for t b -/t b o claimed by Neubert ||131|| is unacceptable. 
While one cannot guarantee a priori that the onset of the 1 / rriQ expansion is safely 
below rrib and the latter can be applied to actual b hadrons, it is certain that such 
large deviations cannot be obtained in the framework of the self-consistent I/tuq 
expansion of the width itself. The quoted interval emerged merely as a result of 
reckless manipulation with ad hoc introduced hadronic parameters. Their values 
saturating such corrections would violate certain bounds ||132|| which are analogous 
to the unitarity constraints. In any case, these flavor-dependent l/rrig effects under 
discussion originate only in particular quark decay channels. It is clear that the 
preasymptotic effects must constitute only a small fraction of the widths in order to 
be amenable to the 1/mg expansion. 
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The agreement of the data on B meson lifetimes with experiment is obscured by 
the apparent conflict for T^ b /r Bd . To predict the 1/ttIq corrections to T\ b in the l/m b 
expansion one needs to evaluate the baryonic expectation values of two operators, 

A a A a 

(A b \bbu lo u\A b ) , (A b \b—bu lo —u\A b ) . (137) 

They do not have the usual factorizable contribution, and their values are rather 
uncertain. Nevertheless, it was shown that their contributions cannot be too large 
132|| and the maximal effect in T^ b does not exceed 10-12%. To achieve the larger 



corrections one would have to go beyond a usual description of baryons when light 
quarks are "soft". This agrees with the fact that the constituent quark model 
estimates typically yield about 3 to 5% enhancement ||133|| . A similar conclusion 
has been reached by the authors of Ref. ||134|| who analyzed the relevant baryonic 



matrix elements through QCD sum rules. 

We see that there are indications of disagreement between theory and experi- 
ment, at the level of at least two quoted experimental error bars. The opinion is 
sometimes expressed that before a disagreement reaches three standard deviations, 
it is premature to start worrying. I am not sure that it fully applies. On the one 
hand, the literal theoretical expectations for T\ b /rB d concentrate closer to 0.95, and 
the quoted limit 0.9 is not a central theoretical value. On the other hand, these 
theoretical predictions were known already for a number of years [125] , so I hope 
that the experiments have analyzed obvious possible biases in the data and corrected 
those which could generate the enhanced gap. Neither argument is convincing by 
itself, but together they give me a feeling that the disagreement does exist. 

If it will be firmly established in the future experiments, I think that in the 
framework of the SM the most probable explanation is the violation of duality. As 
a matter of fact, its significance in nonleptonic widths is theoretically expected a 
priori. Indeed, the expansion parameter for the widths is not m b directly but rather 
the energy release which is noticeably smaller in b — > c. Moreover, the preasymp- 
totic corrections depend on the concrete form of the weak interaction involved. For 
the four-fermion interaction they are enhanced: the large-5 arguments [[3^] men- 
tioned earlier suggest that the actual scale parameter is smaller, ~ E re \/5. In the 
semileptonic decays this does not deteriorate the expansion since it is automatically 
protected by the heavy quark symmetry when m c increases. Even if we keep the 
invariant mass of leptons ^fcp fixed, the corresponding quark width remains correct 
at least up to 1/rriQ terms for arbitrary, for example, maximal q 2 when the energy 
release is tiny, see Eq. ( |131| ). 

For the nonleptonic decays the heavy quark symmetry does not generally apply, 
and at insufficient energy release one expects significant violations of duality. A few 
years ago, therefore, we would not be too much concerned with present discrepancies. 
Now, on the other hand, we have evaluated the leading terms in the 1/m expansion 
of the width and did not find indications for significant effects. Since violation of 
duality is conceptually related to the asymptotic nature of the OPE ||135| , |3~3| , we 
could have expected that duality works at a few percent level as well. 
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4.3 Expansion and Duality Violation 

The problem of duality violation attracts more and more attention of those who 
study the heavy quark theory; a recent extensive discussion was given in f33|. The 



expansion in 1/ttiq is asymptotic. There are basically two questions one can ask 
here: what is the onset of duality, i.e. when does the expansion start to work? 
The most straightforward approach was undertaken in Ref. [136|, and no apparent 



indication toward an increased energy scale was found. Another question, how is the 
equality of the QCD parton-based predictions with the actual decay rates achieved, 
was barely addressed. Though a relevant example of such a problem is easy to give. 

The OPE ensures that no terms ~ 1/tuq can be in the widths in QCD and the 
corrections start with 1/wJq. However, the OPE per se cannot forbid a scenario 
where, for instance, 

C ^V 1 ' '~ A 5™' (138) 

1 h q rriQP 

In the actual strong interaction, m& and m c are fixed and not free parameters, so, 
from the practical viewpoint these types of corrections are not too different - but the 
difference is profound in the theoretical description! It reflects specifics of the OPE 
in Minkowski space, and such effects can hardly be addressed, for example, in the 
lattice simulations. Their control requires a deeper understanding of the underlying 
QCD dynamics beyond the knowledge of first few nonperturbative condensates. 

In fact, the literal corrections of the type of Eq. (|138|) are hardly possible; the 
power of 1/ rriQ in realistic scenarios is larger, and these duality corrections must be 
eventually exponentially suppressed though, probably, starting at a higher scale |33| . 
But a theory of such effects is still in its embryonic stage and needs an additional 
experimental input as well. 

Nevertheless, we know that the duality-violating terms in the HQE are not com- 
pletely arbitrary, and they must obey certain constraints. For example, they cannot 
be monotonous and positive or negative but oscillate, etc. All these exact properties 
are often missed in the theoretical discussions. 

It is worth adding that the folklore about violations of duality is often rather 
superficial. Even the used terminology is not always consistent. It was written 
recently, for example, in [|105|| that the assumptions behind the I/tuq expansion for 
the widths, in particular, for the nonleptonic widths, include a certain factorization 
of the decay amplitudes. In fact, such a factorization is not required at all, like it is 
not required in the asymptotic expansion of R e + e -(s). The only assumption is the 
usual factorization of contributions of large-distance and short- distance physics, the 
OPE. Moreover, in principle it is enough to assume it only in Euclidean domain. A 
stronger assumption of such a factorization for actual decay processes in Minkowski 
kinematics at fixed energy would lead to additional constraints on violation of local 
duality. 

It has been quite popular to state that the quark-hadron duality sets in only 
when the number of hadrons in the final state becomes large ||137|| . This is a wrong 
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criterium. The semileptonic decays in the SV kinematics are a nice example. In the 
limit 

m b ~ m c 

m b , m c > A QCD , m b - m c > A QCD , ■ ~ v < 1 (139) 

nib + m c 

the exact quark-hadron duality perfectly holds with the theorem-like rigorousness 
while the total decay rate is saturated by only two final states, D* and D (even only 
one state can be retained if special weak currents are considered). Simultaneously, 
the yield of the excited states can be arbitrary small: it is driven by two parameters 

(Aqcd/^c) 2 < 1 and ( ^~^ ) < 1. This fact was actually noted in |20f in the mid 



80's and was among the first steps in the development of the heavy quark symmetry 
and dynamic 1/rriQ expansion. It is really surprising why this classic result used to 
be ignored even in the relatively recent papers. 

On the other hand, the duality violation has peculiarity in decays of heavy 
quarks. Apart from the usual assumption of local duality, the practical applications 
of the OPE expansion relies here on the property of global duality, which can also 
be violated to a certain extent. An explanation of what is global duality would lead 
me too far astray; it can be found in Ref. |18|], Sect. IV. A where this notion was 
introduced. Let me only note that since then this term has often been abused in the 
literature. For example, it is tavtological to discuss the violations of global duality 
in R e + e - or in the hadronic r decay width (more accurately, it would be merely 
verifying the dispersion relation). Everything there is local duality and the global 
duality is a very simple statement, a direct consequence of analyticity and unitarity. 

I come to the end of my lectures without a formal concluding section. Theory of 
heavy quarks has become a well-developed field of QCD. Not only it benefits from 
the power of theoretical methods elaborated in QCD over 25 years of evolution; 
the application of the heavy quark expansion provides feedback for the theory of 
QCD itself. A few slices of the heavy quark theory I discussed here demonstrate 
that it is an actively evolving field where many interesting questions still are to be 
understood. Without doubts, the data from the new generation of experiments will 
provide the new impetus; we all are looking forward to new fascinating surprises. 
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